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Protein/(Poly)peptide Libraries 



* Field of the Invention 

j& The present invention relates to synthetic DNA sequences which encode one or 

more collections of homologous proteins/(po!y)peptides, and methods for 
generating and applying libraries of these DNA sequences. In particular, the 
invention relates to the preparation of a library of human-derived antibody genes by 
the use of synthetic consensus sequences which cover the structural repertoire of 
antibodies encoded in the human genome. Furthermore, the invention relates to the 
use of a single consensus antibody gene as a universal framework for highly 
diverse antibody libraries. 

Background to the Invention 

All current recombinant methods which use libraries of proteins/(poty)peptides, e.g. 
antibodies, to screen for members with desired properties, e.g. binding a given 
ligand, do not provide the possibility to improve the desired properties of the 
members in an easy and rapid manner. Usually a library is created either by 
inserting a random oligonucleotide sequence into one or more DNA sequences 
cloned from an organism, or a family of DNA sequences is cloned and used as the 
library. The library is then screened, e.g. using phage display, for members which 
show the desired property. The sequences of one or more of these resulting 
molecules are then determined. There is no general procedure available to improve 
these molecules further on. 

Winter (EP 0 368 684 B1) has provided a method for amplifying (by PCR), cloning, 
and expressing antibody variable region genes. Starting with these genes he was 
able to create libraries of functional antibody fragments by randomizing the CDR3 of 
the heavy and/or the light chain. This process is functionally equivalent to the natural 
process of VJ and VDJ recombination which occurs during the development of B- 
cells in the immune system. 

However the Winter invention does not provide a method for optimizing the binding 

* affinities of antibody fragments further on, a process which would be functionally 
equivalent to the naturally occurring phenomenon of "affinity maturation", which is 

> provided by the present invention. Furthermore, the Winter invention does not 

provide for artificial variable region genes, which represent a whole family of 
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structurally similar natural genes, and which can be assembled from syrUhe'c DNA 
oligonucleotides. Additionally. Winter does not enable the combinatorial assembly 
of porfons of antibody variable regions, a feature which is provided by the present 
.nvent.on. Furthermore, this approach has the disadvantage that the genes of all 
ant.bod.es obtained in the screening procedure have to be completely sequenced 
since except for the PGR priming regions, no additional sequence information aboui 
he horary members is available. This is time and labor intensive and potentially 
leads to sequencing errors. 

The leaching of Winter as we,, as other approaches have tried ,o create ,ar S e 
anybody hbranes hav|ng high diversity in ,he complementary determining regions 
(CDRs) as wetl as in the frameworks to be able to find antibodies against as Iny 
drfferen, anngens as possible. „ has been suggested tha, a single universe 
framewo, may be useful to build antibody Hbraries, bu, no approach has ye. been 
successful. 

Another problem ,ies in the production o, reagents derived from antibodies. Small 

reaoenis andT^ T '"""^ ** aS ,herapeutic «°9™<* 

reagen, s . and , or b,ochem,cal research. Thus. Ihey are needed in large amounts 

and the expressron of antibody fragments, e.g. Fv, single-chain Fv (scFv,. or Fab in 

the periplasm of E co,i (Skerra S Pluckthun. 1988; Better et a!.. 1988, is now used 

rout.nely ,n many laboratories. Expression yields vary widely, however. White some 

fragments y,e,d up to several mg of functional, soluble protein per liter and OD of 

culture broth ,n shake flask culture (Carter et al„ 1992, Pluckthun et al. 1 996) other 

fragments may almost exclusively lead to insoluble material, often found in so-'called 

inclusion bod.es. Functional protein may be obtained from the latter in modes, yields 

by a laborious and time-consuming refolding process. The factors influencing 

st™ or*'"" S S "'" 0n ' y Undere,00d ' FoWi "8 and 

stabmtyofthe an.,body fragments, protease .ability and toxicity of the expressed 
proteins to the host cells often severely limit actua, production Lets, 
attempts have been tried to increase expression yields. For example Knapoik S 
PDckthun ,1995, cou.d show ,ha, expression yield depends on ^e an d 
sequence. They identified key residues in the antibody framework which influence 
« xpression yields drama,ica,,y. Similarly, u,lrich e, a,. (1995) found tha, poin, 
mu,a„ons ,n the CDRs can increase ,he yields in periplasm, antibody fragmen 

zr:rw r erthe,ess ' ,hese s,ra,e9ies are oniy <° * - * 

Since ,he Winter ,nven„on uses existing repertoires o, antibodies, no influence on 
expressibilrty of the genes is possible. 
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Furthermore, the findings of Knappik & Pluckthun and Ullrich demonstrate that the 
knowledge about antibodies, especially about folding and expression is still 
increasing. The Winter invention does not allow to incorporate such improvements 
into the library design. 

The expressibility of the genes is important for the library quality as well, since the 
screening procedure relies in most cases on the display of the gene product on a 
phage surface, and efficient display relies on at least moderate expression of the 
gene. 

These disadvantages of the existing methodologies are overcome by the present 
invention, which is applicable for all collections of homologous proteins. It has the 
following novel and useful features illustrated in the following by antibodies as an 
example: 

Artificial antibodies and fragments thereof can be constructed based on known 
antibody sequences, which reflect the structural properties of a whole group of 
homologous antibody genes. Therefore it is possible to reduce the number of 
different genes without any loss in the structural repertoire. This approach leads to a 
limited set of artificial genes, which can be synthesized de novo, thereby allowing 
introduction of cleavage sites and removing unwanted cleavages sites. Furthermore, 
this approach enables (i), adapting the codon usage of the genes to that of highly 
expressed genes in any desired host cell and (ii), analyzing all possible pairs of 
antibody light (L) and heavy (H) chains in terms of interaction preference, antigen 
preference or recombinant expression titer, which is virtually impossible using the 
complete collection of antibody genes of an organism and all combinations thereof. 

The use of a limited set of completely synthetic genes makes it possible to create 
cleavage sites at the boundaries of encoded structural sub-elements. Therefore, 
each gene is built up from modules which represent structural sub-elements on the 
protein/(poly)peptide level, in the case of antibodies, the modules consist of 
"framework" and "CDR" modules. By creating separate framework and CDFt 
modules, different combinatorial assembly possibilities are enabled. Moreover, if 
two or more artificial genes carry identical pairs of cleavage sites at the boundaries 
of each of the genetic sub-elements, pre-built libraries of sub-elements can be 
inserted in these genes simultaneously, without any additional information related to 
any particular gene sequence. This strategy enables rapid optimization of, for 
example, antibody affinity, since DNA cassettes encoding libraries of genetic sub- 
elements can be (i), pre-built, stored and reused and (ii), inserted in any of these 
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sequences at the right position without knowing th3 actual sequence or having to 
determine the sequence of the individual library member. 

Additionally, new information about amino acid residues important for binding, 
stability, or solubility and expression could be integrated into the library design by 
replacing existing modules with modules modified according to the new 
observations. 

The limited number of consensus sequences used for creating the library allows to 
speed up the identification of binding antibodies after screening. After having 
identified the underlying consensus gene sequence, which could be done, by 
sequencing or by using fingerprint restriction sites, just those part(s) comprising the 
random sequence(s) have to be determined. This reduces the probability of 
sequencing errors and of false-positive results. 

The above mentioned cleavage sites can be used only if they are unique in the 
vector system where the artificial genes have been inserted. As a result, the vector 
has to be modified to contain none of these cleavage sites. The construction of a 
vector consisting of basic elements like resistance gene and origin of replication, 
where cleavage sites have been removed, is of general interest for many cloning 
attempts. Additionally, these vector(s) could be part of a kit comprising the above 
mentioned artificial genes and pre-built libraries. 

The collection of artificial genes can be used for a rapid humanization procedure of 
non-human antibodies, preferably .of rodent antibodies. First, the amino acid 
sequence of the non-human, preferably rodent antibody is compared with the amino 
acid sequences encoded by the collection of artificial genes to determine the most 
homologous light and heavy framework regions. These genes are then used for 
insertion of the genetic sub-elements encoding the CDRs of the non-human, 
preferably rodent antibody. 

Surprisingly, it has been found that with a combination of only one consensus 
sequence for each of the light and heavy chains of a scFv fragment an antibody 
repertoire could be created yielding antibodies againsl virtually every antigen. 
Therefore, one aspect of the present invention is the use of a single consensus 
sequence as a universal framework for the creation of useful (poly)peptide libraries 
and antibody consensus sequences useful therefor. 
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The present invention enables the creation of useful libraries of (poly)peptides. In a 
first embodiment, the invention provides for a method of setting up nucleic acid 
sequences suitable for the creation of said libraries. In a first step, a collection of at 
least three homologous proteins is identified and then analyzed. Therefore, a 
database of the protein sequences is established where the protein sequences are 
aligned to each other. The database is used to define subgroups of protein 
sequences which show a high degree of similarity in both the sequence and, if 
information is available, in the structural arrangement. For each of the subgroups a 
(poly)peptide sequence comprising at least one consensus sequence is deduced 
which represents the members of this subgroup; the complete collection of 
(poly)peptide sequences represent therefore the complete structural repertoire of 
the collection of homologous proteins. These artificial (poly)peptide sequences are 
then analyzed, if possible, according to their structural properties to identify 
unfavorable interactions between amino acids within said (poly)peptide sequences 
or between said or other (poly)peptide sequences, for example, in multimeric 
proteins. Such interactions are then removed by changing the consensus sequence 
accordingly. The (poly)peptide sequences are then analyzed to identify sub- 
elements such as domains, loops, helices or CDRs. The amino acid sequence is 
backtranslated into a corresponding coding nucleic acid sequence which is adapted 
to the codon usage of the host planned for expressing said nucleic acid sequences. 
A set of cleavage sites is set up in a way that each of the sub-sequences encoding 
the sub-elements identified as described above, is flanked by two sites which do not 
occur a second time within the nucleic acid sequence. This can be achieved by 
either identifying a cleavage site already flanking a sub-sequence of by changing 
one or more nucleotides to create the cleavage site, and by removing that site from 
the remaining part of the gene. The cleavage sites should be common to all 
corresponding sub-elements or sub-sequences, thus creating a fully modular 
arrangement of the sub-sequences in the nucleic acid sequence and of the sub- 
elements in the corresponding (poly)peptide. 

In a further embodiment, the invention provides for a method which sets up two or 
more sets of (poly)peptides, where for each set the method as described above is 
performed, and where the cleavage sites are not only unique within each set but 
also between any two sets. This method can be applied for the creation of 
(poly)peptide libraries comprising for example two a-helical domains from two 
different proteins, where said library is screened for novel hetero-association 
domains. 
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In yet a further embodiment, at least two of the sets as described above, are derived 
from the same collection of proteins or at least a part of it. This describes libraries 
comprising for example, but not limited to, two domains from antibodies such as VH 
and VL, or two extracellular loops of transmembrane receptors. 

In another embodiment, the nucleic acid sequences set up as described above, are 
synthesized. This can be achieved by any one of several methods well known to the 
practitioner skilled in the art, for example, by total gene synthesis or by PCR-based 
approaches. 

In one embodiment, the nucleic acid sequences are cloned into a vector. The vector 
could be a sequencing vector, an expression vector or a display (e.g. phage display) 
vector, which are well known to those skilled in the art. Any vector could comprise 
one nucleic acid sequence, or two or more nucleic sequences, either in different or 
the same operon. In the last case, they could either be cloned separately or as 
contiguous sequences. 

In one embodiment, the removal of unfavorable interactions as described above, 
leads to enhanced expression of the modified (poly)peptides. 

In a preferred embodiment, one or more sub-sequences of the nucleic acid 
sequences are replaced by different sequences. This can be achieved by excising 
the sub-sequences using the conditions suitable for cleaving the cleavage sites 
adjacent to or at the end of the subsequence, for example, by using a restriction 
enzyme at the corresponding restriction site under the conditions well known to 
those skilled in the art, and replacing the sub-sequence by a different sequence 
compatible with the cleaved nucleic acid sequence. In a further preferred 
embodiment, the different sequences replacing the initial sub-sequence(s) are 
genomic or rearranged genomic sequences, for example in grafting CDRs from non- 
human antibodies onto consensus antibody sequences for rapid humanization of 
non-human antibodies. In the most preferred embodiment, the different sequences 
are random sequences, thus replacing the sub-sequence by a collection of 
sequences to introduce variability and to create a library. The random sequences 
can be assembled in various ways, for example by using a mixture of 
mononucleotides or preferably a mixture of trinucleotides (Vimekas et al., 1994) 
during automated oligonucleotide synthesis, by error-prone PCR or by other 
methods well known to the practitioner in the art. The random sequences may be 
completely randomized or biased towards or against certain codons according to 
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the amino acid distribution at certain positions in known protein sequences. 
Additionally, the collection of random sub-sequences may comprise different 
numbers of codons, giving rise to a collection of sub-elements having different 
lengths. 

In another embodiment, the invention provides for the expression of the nucleic acid 
sequences from a suitable vector and under suitable conditions well known to those 
skilled in the art. 

In a further preferred embodiment, the (poly)peptides expressed from said nucleic 
acid sequences are screened and, optionally, optimized. Screening may be 
performed by using one of the methods well known to the practitioner in the art, such 
as phage-display, selectively infective phage, polysome technology to screen for 
binding, assay systems for enzymatic activity or protein stability. (Poly)peptides 
having the desired property can be identified by sequencing of the corresponding 
nucleic acid sequence or by amino acid sequencing or mass spectrometry. In the 
case of subsequent optimization, the nucleic acid sequences encoding the initially 
selected (poly)peptides can optionally be used without sequencing. Optimization is 
performed by repeating the replacement of sub-sequences by different sequences, 
preferably by random sequences, and the screening step one or more times. 

The desired property the (poly)peptides are screened for is preferably, but not 
exclusively, selected from the group of optimized affinity or specificity for a target 
molecule, optimized enzymatic activity, optimized expression yields, optimized 
stability and optimized solubility. 

In one embodiment, the cleavage sites flanking the sub-sequences are sites 
recognized and cleaved by restriction enzymes, with recognition and cleavage 
sequences being either identical or different, the restricted sites either having blunt 
or sticky ends. 

The length of the sub-elements is preferably, but not exclusively ranging between 1 
amino acid, such as one residue in the active site of an enzyme or a structure- 
determining residue, and 150 amino acids, as for whole protein domains. Most 
preferably, the length ranges between 3 and 25 amino acids, such as most 
commonly found in CDR loops of antibodies. 

The nucleic acid sequences could be RNA or. preferably, DNA. 
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In one embodiment, the (poly)peptides have an amino acid pattern characteristic of 
a particular species. This can for example be achieved by deducing the consensus 
sequences from a collection of homologous proteins of just one species, most 
preferably from a collection of human proteins. Since the (poly)peptides comprising 
consensus sequences are artificial, they have to be compared to the protein 
sequence(s) having the closest similarity to ensure the presence of said 
characteristic amino acid pattern. 

In one embodiment, the invention provides for the creation of libraries of 
(poly)peptides comprising at least part of members or derivatives of the 
immunoglobulin superfamily, preferably of member or derivatives of the 
immnoglobulins. Most preferably, the invention provides for the creation of libraries 
of human antibodies, wherein said (poly)peptides are or are derived from heavy or 
light chain variable regions wherein said structural sub-elements are framework 
regions (FR) 1, 2, 3, or 4 or complementary determining regions (CDR) 1, 2, or 3. In a 
first step, a database of published antibody sequences of human origin is 
established where the antibody sequences are aligned to each other. The database 
is used to define subgroups of antibody sequences which show a high degree of 
similarity in both the sequence and the canonical fold of CDR loops (as determined 
by analysis of antibody structures). For each of the subgroups a consensus 
sequence is deduced which represents the members of this subgroup; the complete 
collection of consensus sequences represent therefore the complete structural 
repertoire of human antibodies. 

These artificial genes are then constructed e.g. by total gene synthesis or by the use 
of synthetic genetic subunits. These. genetic subunits correspond to structural sub- 
elements on the (poly)peptide level. On the DNA level, these genetic subunits are 
defined by cleavage sites at the start and the end of each of the sub-elements, which 
are unique in the vector system. All genes which are members of the collection of 
consensus sequences are constructed such that they contain a similar pattern of 
corresponding genetic sub-sequences. Most preferably, said (poly)peptides are or 
are derived from the HuCAL consensus genes: Vk1, Vk2, Vk3 Vk4 VX1 V>.2 V? 3 
VH1A, VH1B, VH2, VH3, VH4. VH5, VH6, Ck. CX, CH1 or any combination of said 
HuCAL consensus genes. 

This collection of DNA molecules can then be used to create libraries of antibodies 
or antibody fragments, preferably Fv, disulphide-linked Fv, single-chain Fv (scFv), or 
Fab fragments, which may be used as sources of specificities against new target 
antigens. Moreover, the affinity of the antibodies can be optimized using pre-built 
library cassettes and a general procedure. The invention provides a method for 
.dentifying one or more genes encoding one or more antibody fragments which 
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binds to a target, comprising the steps of axpressing the antibody fragments, and 
then screening them to isolate one or more antibody fragments which bind to a 
given target molecule. Preferably, an scFv fragment library comprising the 
combination of HuCAL VH3 and HuCAL V^2 consensus genes and at least a 
random sub-sequence encoding the heavy chain CDR3 sub-element is screened for 
binding antibodies. If necessary, the modular design of the genes can then be used 
to excise from the genes encoding the antibody fragments one or more genetic sub- 
sequences encoding structural sub-elements, and replacing them by one or more 
second sub-sequences encoding structural sub-elements. The expression and 
screening steps can then be repeated until an antibody having the desired affinity is 
generated. 

Particularly preferred is a method in which one or more of the genetic subunits (e.g. 
the CDRs) are replaced by a random collection of sequences (the library) using the 
said cleavage sites. Since these cleavage sites are (i) unique in the vector system 
and (ii) common to all consensus genes, the same (pre-built) library can be inserted 
into all artificial antibody genes. The resulting library is then screened against any 
chosen antigen. Binding antibodies are selected, collected and used as starting 
material for the next library. Here, one or more of the remaining genetic subunits are 
randomized as described above. 

A further embodiment of the present invention relates to fusion proteins by providing 
for a DNA sequence which encodes both the (poly)peptide, as described above, as 
well as an additional moiety. Particularly preferred are moieties which have a useful 
therapeutic function. For example, the additional moiety may be a toxin molecule 
which is able to kill cells (Vitetta et al., 1993). There are numerous examples of such 
toxins, well known to the one skilled in the art, such as the bacterial toxins 
Pseudomonas exotoxin A, and diphtheria toxin, as well as the plant toxins ricin, 
abrin, modeccin, saporin, and gelonin; By fusing such a toxin for example to an 
antibody fragment, the toxin can be targeted to, for example, diseased cells, and 
thereby have a beneficial therapeutic effect. Alternatively, the additional moiety may 
be a cytokine, such as IL-2 (Rosenberg & Lotze, 1986), which has a particular effect 
(in this case a T-cell proliferative effect) on a family of cells. In a further embodiment, 
the additional moiety may confer on its (poly)peptide partner a means of detection 
and/or purification. For example, the fusion protein could comprise the modified 
antibody fragment and an enzyme commonly used for detection purposes, such as 
alkaline phosphatase (Blake et al., 1984). There are numerous other moieties which 
can be used as detection or purification tags, which are well known to the 
practitioner skilled in the art. Particularly preferred are peptides comprising at least 
five histidine residues (Hochuli et al., 1988), which are able to bind to metal ions, 
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and can therefore be used for the purification cf ths protein to which they are fused 
(Lindner et al., 1 992). Also provided for by the invention are additional moieties such 
as the commonly used C-myc and FLAG tags (Hopp et al., 1988; Knappik & 
Pluckthun, 1994). 

By engineering one or more fused additional domains, antibody fragments or any 
other (poly)peptide can be assembled into larger molecules which also fall under 
the scope of the present invention. For example, mini-antibodies (Pack, 1994) are 
dimers comprising two antibody fragments, each fused to a self-associating 
dimerization domain. Dimerization domains which are particularly preferred include 
those derived from a leucine zipper (Pack & Pluckthun, 1 992) or helix-turn-helix 
motif (Pack et al., 1 993). - 

All of the above embodiments of the present invention can be effected using 
standard techniques of molecular biology known to anyone skilled in the art. 

In a further embodiment, the random collection of sub-sequences (the library) is 
inserted into a singular nucleic acid sequence encoding one (poly)peptide, thus 
creating a (poly)peptide library based on one universal framework. Preferably a 
random collection of CDR sub-sequences is inserted into a universal antibody 
framework, for example into the HuCAL H3k2 single-chain Fv fragment described 
above. 



In further embodiments, the invention provides for nucleic acid sequence(s), 
vector(s) containing the nucleic acid sequence(s), host cell(s) containing the 
vector(s), and (poly)peptides, obtainable according to the methods described above. 

In a further preferred embodiment, the invention provides for modular vector systems 
being compatible with the modular nucleic acid sequences encoding the 
(poly)peptides. The modules of the vectors are flanked by restriction sites unique 
within the vector system and essentially unique with respect to the restriction sites 
incorporated into the nucleic acid sequences encoding the (poly)peptides. except 
for example the restriction sites necessary for cloning the nucleic acid sequences 
into the vector. The list of vector modules comprises origins of single-stranded 
replication, origins of double-stranded replication for high- and low copy number 
plasmids, promotor/operator, repressor or terminator elements, resistance genes, 
potential recombination sites, gene III for display on filamentous phages, signal 
sequences, purification and detection tags, and sequences of additional moieties. 
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The vectors are preferably, but not exclusively, expression vectors or vectors 
suitable for expression and screening of libraries. 



In another embodiment, the invention provides for a kit, comprising one or more of 
the list of nucleic acid sequence(s), recombinant vector(s), (poly)peptide(s), and 
vector(s) according to the methods described above, and suitable host cell(s) for 
producing the (poly)peptide(s). 

In a preferred embodiment, the invention provides for the creation of libraries of 
human antibodies. In a first step, a database of published antibody sequences of 
human origin is established: The database is used to define subgroups of antibody 
sequences which show a high degree of similarity in both the sequence and the 
canonical fold (as determined by analysis of antibody structures). For each of the 
subgroups a consensus sequence is deduced which represents the members of this 
subgroup; the complete collection of consensus sequences represent therefore the 
complete structural repertoire of human antibodies. 

These artificial genes are then constructed by the use of synthetic genetic subunits. 
These genetic subunits correspond to structural sub-elements on the protein level. 
On the DNA level, these genetic subunits are defined by cleavage sites at the start 
and the end of each of the subelements, which are unique in the vector system. All 
genes which are members of the collection of consensus sequences are 
constructed such that they contain a similar pattern of said genetic subunits. 

This collection of DNA molecules can then be used to create libraries of antibodies 
which may be used as sources of specificities against new target antigens. 
Moreover, the affinity of the antibodies can be optimised using pre-built library 
cassettes and a general procedure. The invention provides a method for identifying 
one or more genes encoding one or more antibody fragments which binds to a 
target, comprising the steps of expressing the antibody fragments, and then 
screening them to isolate one or more antibody fragments which bind to a given 
target molecule. If necessary, the modular design of the genes can then be used to 
excise from the genes encoding the antibody fragments one or more genetic sub- 
sequences encoding structural sub-elements, and replacing them by one or more 
second sub-sequences encoding structural sub-elements. The expression and 
screening steps can then be repeated until an antibody having the desired affinity is 
generated. 
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Particularly preferred is a method in which one or rvo r e of the genetic subunits (e.g. 
the CDR's) are replaced by a random collection of sequences (the library) using the 
said cleavage sites. Since these cleavage sites are (i) unique in the vector system 
and (ii) common to all consensus genes, the same (pre-built) library can be inserted 
into all artificial antibody genes. The resulting library is then screened against any 
chosen antigen. Binding antibodies are eluted, collected and used as starting 
material for the next library. Here, one or more of the remaining genetic subunits are 
randomised as described above. 



-12- 

SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 



PCT/EP96/03647 



Definitions 

Protein: 

The term protein comprises monomeric polypeptide chains as well as homo- or 
heteromultimeric complexes of two or more polypeptide chains connected either by 
covalent interactions (such as disulphide bonds) or by non-covalent interactions 
(such as hydrophobic or electrostatic interactions). 

Analysis of homologous proteins: 

The amino acid sequences of three or more proteins are aligned to each other 
(allowing for introduction of gaps) in a way which maximizes the correspondence 
between identical or similar amino acid residues at all positions. These aligned 
sequences are termed homologous if the percentage of the sum of identical and/or 
similar residues exceeds a defined threshold. This threshold is commonly regarded 
by those skilled in the art as being exceeded when at least 15% of the amino acids 
in the aligned genes are identical, and at least 30% are similar. Examples for 
families of homologous proteins are: immunoglobulin superfamily, scavenger 
receptor superfamily, fibronectin superfamilies (e.g. type II and III), complement 
control protein superfamily, cytokine receptor superfamily, cystine knot proteins, 
tyrosine kinases, and numerous other examples well known to one of ordinary skill 
in the art. 

Consensus sequence: 

Using a matrix of at least three aligned amino acid sequences, and allowing for 
gaps in the alignment, it is possible to determine the most frequent amino acid 
residue at each position. The consensus sequence is that sequence which 
comprises the amino acids which are most frequently represented at each position. 
In the event that two or more amino acids are equally represented at a single 
position, the consensus sequence includes both or all of those amino acids. 

Removing unfavorable interactions: 

The consensus sequence is per se in most cases artificial and has to be analyzed in 
order to change amino acid residues which, for example, would prevent the 
resulting molecule to adapt a functional tertiary structure or which would block the 
interaction with other (poly)peptide chains in multimeric complexes. This can be 
done either by (i) building a three-dimensional model of the consensus sequence 
using known related structures as a template, and identifying amino acid residues 
within the model which may interact unfavorably with each other, or (ii) analyzing the 
matrix of aligned amino acid sequences in order to detect combinations of amino 
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acid residues within the sequences which frequently occur together in one 
sequence and are therefore likely to interact with each other. These probable 
interaction-pairs are then tabulated and the consensus is compared with these 
"interaction maps". Missing or wrong interactions in the consensus are repaired 
accordingly by introducing appropriate changes in amino acids which minimize 
unfavorable interactions. 

Identification of structu ral sub-elements: 

Structural sub-elements are stretches of amino acid residues within a 
protein/(poly)peptide which correspond to a defined structural or functional part of 
the molecule. These can be loops (e.g. CDR loops of an antibody) or any other 
secondary or functional structure within the protein/(poly)peptide (domains, a- 
helices, (3-sheets, framework regions of antibodies, etc.). A structural sub-element 
can be identified using known structures of similar or homologous (poly)peptides, or 
by using the above mentioned matrices of aligned amino acid sequences. Here the 
variability at each position is the basis for determining stretches of amino acid 
residues which belong to a structural sub-element (e.g. hypervariable regions of an 
antibody). 

Sub-sequence: 

A sub-sequence is defined as a genetic module which is flanked by unique 
cleavage sites and encodes at least one structural sub-element. It is not necessarily 
identical to a structural sub-element. 

Cleavage site: 

A short DNA sequence which is used as a specific target for a reagent which 
cleaves DNA in a sequence-specific manner (e.g. restriction endonucleases). 

Compatible cleavage sites: 

Cleavage sites are compatible with each other, if they can be efficiently ligated 
without modification and, preferably, also without adding an adapter molecule.. 

Unique cleavage sites: 

A cleavage site is defined as unique if it occurs only once in a vector containing at 
least one of the genes of interest, or if a vector containing at least one of the genes 
of interest could be treated in a way that only one of the cleavage sites could be 
used by the cleaving agent. 
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Corresponding (polypeptide sequences: 

Sequences deduced from the same part of one group of homologous proteins are 
called corresponding (poly)peptide sequences. 

Common cleavage sites: 

A cleavage site in at least two corresponding sequences, which occurs at the same 
functional position (i.e. which flanks a defined sub-sequence), which can be 
hydrolyzed by the same cleavage tool and which yields identical compatible ends is 
termed a common cleavage site. 

Fxcisino genetic sub-seauences: 

A method which uses the unique cleavage sites and the corresponding cleavage 
reagents to cleave the target DNA at the specified positions in order to isolate, 
remove or replace the genetic sub-sequence flanked by these unique cleavage 
sites. 

Exchanging genetic sub-seouences: 

A method by which an existing sub-sequence is removed using the flanking 
cleavage sites of this sub-sequence, and a new sub-sequence or a collection of 
sub-sequences, which contain ends compatible with the cleavage sites thus 
created, is inserted. 

Expression of oenes: 

The term expression refers to in vivo or in vitro processes, by which the information 
of a gene is transcribed into mRNA and then translated into a protein/(poly)peptide. 
Thus, the term expression refers to a process which occurs inside cells, by which the 
information of a gene is transcribed into mRNA and then into a protein. The term 
expression also includes all events of post-translational modification and transport, 
which are necessary for the (poly)peptide to be functional. 

Screening of protein/fpolyVpeptide libraries: 

Any method which allows isolation of one or more proteins/(poly)peptides having a 
desired property from other proteins/(poly)peptides within a library. 

Amino acid pattern characteristic for a species: 

A (poly)peptide sequence is assumed to exhibit an amino acid pattern characteristic 
for a species if it is deduced from a collection of homologous proteins from just this 
species. 
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Immunoglobulin superfamily (IgSR: 

The IgSF is a family of proteins comprising domains being characterized by the 
immunoglobulin fold. The IgSF comprises for example T-cell receptors and the 
immunoglobulins (antibodies). 

Antibody framework: 

A framework of an antibody variable domain is defined by Kabat et al. (1991) as the 
part of the variable domain which serves as a scaffold for the antigen binding loops 
of this variable domain. 

Antibody CDR: 

The CDRs (complementarity determining regions) of an antibody consist of the 
antigen binding loops, as defined by Kabat et al. (1991). Each of the two variable 
domains of an antibody Fv fragment contain three CDRs. 

HuCAL: 

Acronym for Human Combinatorial Antibody Library. Antibody Library based on 
modular consensus genes according to the invention (see Example 1). 

Antibody fragment: 

Any portion of an antibody which has a particular function, e.g. binding of antigen. 
Usually, antibody fragments are smaller than whole antibodies. Examples are Fv, 
disulphide-linked Fv, single-chain Fv (scFv), or Fab fragments. Additionally, antibody 
fragments are often engineered to include new functions or properties. 

Universal framework: 

One single framework which can be used to create the full variability of functions, 
specificities or properties which is originally sustained by a large collection of 
different frameworks, is called universal framework. 

Binding of an antibody to its target- 

The process which leads to a tight and specific association between an antibody 
and a corresponding molecule or ligand is called binding. A molecule or ligand or 
any part of a molecukle or ligand which is recognized by an antibody is called the 
target. 

Replacing genetic sub- sequences 

A method by which an existing sub-sequence is removed using the flanking 
cleavage sites of this sub-sequence, and a new sub-sequence or collection of sub- 
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sequences, which contains ends compatible with the clsavag? s ; t?s thus created, is 
inserted. 



Assembling of genetic sequences: 

Any process which is used to combine synthetic or natural genetic sequences in a 
specific manner in order to get longer genetic sequences which contain at least 
parts of the used synthetic or natural genetic sequences. 

Analysis of homologous genes: 

The corresponding amino acid sequences of two or more genes are aligned to each 
other in a way which maximizes the correspondence between identical or similar 
amino acid residues at all positions. These aligned sequences are termed 
homologous if the percentage of the sum of identical and/or similar residues 
exceeds a defined threshold. This threshold is commonly regarded by those skilled 
in the art as being exceeded when at least 15 per cent of the amino acids in the 
aligned genes are identical, and at least 30 per cent are similar. 
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Fig. 1 : Flow chart outlining the process of construction of a synthetic human 

antibody library based on consensus sequences. 
Fig. 2: Alignment of consensus sequences designed for each subgroup (amino 

acid residues are shown with their standard one-letter abbreviation). (A) 

kappa sequences, (B) lambda sequences and (C), heavy chain 

sequences. The positions are numbered according to Kabat (1991). In 

order to maximize homology in the alignment, gaps (— ) have been 

introduced in the sequence at certain positions. 
Fig. 3: Gene sequences of the synthetic V kappa consensus genes. The 

corresponding amino acid sequences (see Fig. 2) as well as the unique 

cleavage sites are also shown. 
Fig. 4: Gene sequences of the synthetic V lambda consensus genes. The 

corresponding amino acid sequences (see Fig. 2) as well as the unique 

cleavage sites are also shown. 
Fig. 5: Gene sequences of the synthetic V heavy chain consensus genes. The 

corresponding amino acid sequences (see Fig. 2) as well as the unique 

cleavage sites are also shown. 
Fig. 6: Oligonucleotides used for construction of the consensus genes. The 

oligos are named according to the corresponding consensus gene, e.g. 

the gene Vk1 was constructed using the six oligonucleotides 01 K1 to 

01 K6. The oligonucleotides used for synthesizing the genes encoding 

the constant domains Ck (OCLK1 to 8) and CH1 (OCH1 to 8) are also 

shown. 

Fig. 7A/B: Sequences of the synthetic genes encoding the constant domains Ck 
(A) and CH1 (B). The corresponding amino acid sequences as well as 
unique cleavage sites introduced in these genes are also shown. 

Fig. 7C: Functional map and sequence of module M24 comprising the synthetic 
Ck gene segment (huCL lambda). 

Fig. 7D: Oligonucleotides used for synthesis of module M24. 

Fig. 8: Sequence and restriction map of the synthetic gene encoding the 
consensus single-chain fragment VH3-Vk2, The signal sequence (amino 
acids 1 to 21) was derived from the E. coli phoA gene (Skerra & 
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Pluckthun, 1988). Between the phoA signal sequence 2rd the VH3 
domain, a short sequence stretch encoding 4 amino acid residues (amino 
acid 22 to 25) has been inserted in order to allow detection of the single- 
chain fragment in Western blot or ELISA using the monoclonal antibody 
M1 (Knappik & Pluckthun, 1994). The last 6 basepairs of the sequence 

were introduced for cloning purposes (EcoRI site). 
Fig. 9: Plasmid map of the vector plG10.3 used for phage display of the H3k2 
scFv fragment. The vector is derived from plG10 and contains the gene for 
the lac operon repressor, lacl, the artificial operon encoding the H3k2- 
gene3ss fusion under control of the lac promoter, the Ipp terminator of 
transcription, the single-strand replication' origin of the E. coli phage f1 
(F1_ORI), a gene encoding p-lactamase (bla) and the ColEI derived 
origin of replication. 

Fig. 10: Sequencing results of independent clones from the initial library, 
translated into the corresponding amino acid sequences. (A) Amino acid 
sequence of the VH3 consensus heavy chain CDR3 (position 93 to 102, 
Kabat numbering). (B) Amino acid sequences of 12 clones of the 10-mer 
library. (C) Amino acid sequences of 11 clones of the 15-mer library, 4 : 
single base deletion. 

Fig. 11: Expression test of individual library members. (A) Expression of 9 
independent clones of the 10-mer library. (B) Expression of 9 
independent clones of the 15-mer library. The lane designated with M 
contains the size marker. Both the gp3-scFv fusion and the scFv monomer 
are indicated. 

Fig. 12: Enrichment of specific phage antibodies during the panning against FITC- 
BSA. The initial as well as the subsequent fluorescein-specific sub- 
libraries were panned against the blocking buffer and the ratio of the 
phage eluted from the FITC-BSA coated well vs. that from the powder milk 
coated well from each panning round is presented as the „specificity 
factor". 

Fig. 13: Phage ELISA of 24 independent clones after the third round of panning 

tested for binding on FITC-BSA. 
Fig. 14: Competition ELISA of selected FITC-BSA binding clones. The ELISA 

signals (OD 405nm ) of scFv binding without inhibition are taken as 100%. 
Fig. 15: Sequencing results of the heavy chain CDR3s of independent clones 

after 3 rounds of panning against FITC-BSA, translated into the 

corresponding amino acid sequences (position 93 to 102, Kabat 

numbering). 
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Fig. 16: Coomassie-Blue stained SDS-PAGE of the purified anti-fluorescein scFv 
fragments: M: molecular weight marker, A: total soluble cell extract after 
induction, B: fraction of the flow-through, C, D and E: purified scFv 
fragments 1HA-3E4, 1HA-3E5 and 1HA-3E10, respectively. 

Fig. 17: Enrichment of specific phage antibodies during the panning against B- 
estradiol-BSA, testosterone-BSA, BSA, ESL-1 , interleukin-2, 
lymphotoxin-B t and LeY-BSA after three rounds of panning. 

Fig. 18: ELiSA of selected ESL-1 and 8-estradiol binding clones 

Fig- 19: Selectivity and cross-reactivity of HuCAL antibodies: in the diagonal 
specific binding of HuCAL antibodies can be seen, off-diagonal signals 
show non-specific cross-reactivity. 

Fig. 20: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against B-estradioI-BSA, translated into the 
corresponding amino acid sequences (position 93 to 102, Kabat 
numbering). One clone is derived from the 10mer library. 

Fig. 21: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against testosterone-BSA, translated into the 
corresponding amino acid sequences (position 93 to 102, Kabat 
numbering). 

Fig. 22: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against lyrnphotoxin-8 t translated into the 
corresponding amino acid sequences (position 93 to 102, Kabat 
numbering). One clone comprises a 14mer CDR, presumably introduced 
by incomplete coupling of the trinucleotide mixture during oligonucleotide 
synthesis. 

Fig. 23: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against ESL-1, translated into the 
corresponding amino acid sequences (position 93 to 102, Kabat 
numbering). Two clones are derived from the 10mer library. One clone 
comprises a 16mer CDR, presumably introduced by chain elongation 
during oligonucleotide synthesis using trinucleotides. 

Fig. 24: Sequencing results of the heavy chain CDR3s of independent clones 
after 3 rounds of panning against BSA, translated into the corresponding 
amino acid sequences (position 93 to 102, Kabat numbering). 

Fig. 25: Schematic representation of the modular pCAL vector system. 

Fig. 25a: List of restriction sites already used in or suitable for the modular HuCAL 
genes and pCAL vector system. 

Fig. 26: List of the modular vector elements for the pCAL vector series: shown are 
only those restriction sites which are part of the modular system. 
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Fig. 27: Functional map and sequence of the multi-cloning site module ^/CS) 
Fig. 28: Functional map and sequence of the pMCS cloning vector series. 
Fig. 29: Functional map and sequence of the pCAL module M1 (see Fig. 26). 
Fig. 30: Functional map and sequence of the pCAL module M7-W (see Fig. 26). 
Fig. 31 : Functional map and sequence of the pCAL module M9-II (see Fig. 26). 
Fig. 32: Functional map and sequence of the pCAL module M1 1-11 (see Fig. 26). 
Fig. 33: Functional map and sequence of the pCAL module M14-Ext2 (see Fig. 
26). 

Fig. 34: Functional map and sequence of the pCAL module M17 (see Fig. 26). 

Fig. 35: Functional map and sequence of the modular vector pCAL4. 

Fig. 35a: Functional maps and sequences of additional pCAL modules (M2, M3, 
M7I, M7II, M8, M10II, M11II, M12, M13, M19, M20, M21, M41) and of low- 
copy number plasmid vectors (pCALOl to pCAL03). 

Fig. 35b: List of oligonucleotides and primers used for synthesis of pCAL vector 
modules. 

Fig. 36: Functional map and sequence of the B-lactamase cassette for 
replacement of CDRs for CDR library cloning. 

Fig. 37: Oligo and primer design for Vk* CDR3 libraries 

Fig. 38: Oligo and primer design for CDR3 libraries 

Fig. 39: Functional map of the pBS13 expression vector series. 

Fig. 40: Expression of all 49 HuCAL scFvs obtained by combining each of the 7 
VH genes with each of the 7 VL genes (pBS13, 30°C): Values are given 
for the percentage of soluble vs. insoluble material, the total and the 
soluble amount compared to the combination H3k2, which was set to 
100%. In addition, the corresponding values for the McPC603 scFv are 
given. 

Table 1 : Summary of human immunoglobulin germline sequences used for 
computing the germline membership of rearranged sequences. (A) kappa 
sequences, (B) lambda sequences and (C), heavy chain sequences. (1) 
The germline name used in the various calculations, (2) the references 
number for the corresponding sequence (see appendix for sequence 
related citations), (3) the family where each sequence belongs to and (4), 
the various names found in literature for germline genes with identical 
amino acid sequences. 

Table 2: Rearranged human sequences used for the calculation of consensus 
sequences. (A) kappa sequences, (B) lambda sequences and (C). heavy 
chain sequences. The table summarized the name of the sequence (1), 
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the length of the sequence in amino acids (2), th2 gsrmlin'? family (3) as 
well as the computed germline counterpart (4). The number of amino acid 
exchanges between the rearranged sequence and the germline 
sequence is tabulated in (5), and the percentage of different amino acids 
is given in (6). Column (7) gives the references number for the 
corresponding sequence (see appendix for sequence related citations). 
Table 3: Assignment of rearranged V sequences to their germline counterparts. 

(A) kappa sequences, (B) lambda sequences and (C), heavy chain 
sequences. The germline genes are tabulated according to their family 
(1), and the number of rearranged genes found for every germline gene is 
given in (2). 

Table 4: Computation of the consensus sequence of the rearranged V kappa 
sequences. (A), V kappa subgroup 1, (B), V kappa subgroup 2, (C), V 
kappa subgroup 3 and (D), V kappa subgroup 4. The number of each 
amino acid found at each position is tabulated together with the statistical 
analysis of the data. (1) Amino acids are given with their standard one- 
letter abbreviations (and B means D or N, Z means E or Q and X means 
any amino acid). The statistical analysis summarizes the number of 
sequences found at each position (2), the number of occurrences of the 
most common amino acid (3), the amino acid residue which is most 
common at this position (4), the relative frequency of the occurrence of the 
most common amino acid (5) and the number of different amino acids 
found at each position (6). * 

Table 5: Computation of the consensus sequence of the rearranged V lambda 
sequences. (A), V lambda subgroup 1, (B), V lambda subgroup 2, and 
(C), V lambda subgroup 3. The number of each amino acid found at each 
position is tabulated together with the statistical analysis of the data. 
Abbreviations are the same as in Table 4. 

Table 6: Computation of the consensus sequence of the rearranged V heavy chain 
sequences. (A), V heavy chain subgroup 1A, (B), V heavy chain 
subgroup 1B, (C), V heavy chain subgroup 2, (D), V heavy chain 
subgroup 3, (E), V heavy chain subgroup 4, (F), V heavy chain subgroup 
5, and (G), V heavy chain subgroup 6. The number of each amino acid 
found at each position is tabulated together with the statistical analysis of 
the data. Abbreviations are the same as in Table 4. 
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Examples 

Example 1: Design of a Synthetic Human Combinatorial Antibody 
Library (HuCAL) 

The following example describes the design of a fully synthetic human combinatorial 
antibody library (HuCAL), based on consensus sequences of the human 
immunoglobulin repertoire, and the synthesis of the consensus genes. The general 
procedure is outlined in Fig. 1 . 

1.1 Sequence database 

1.1.1 Collection and alignment of human immunoglobulin sequences 

In a first step, sequences of variable domains of human immunoglobulins have been 
collected and divided into three sub bases: V heavy chain (VH), V kappa (Vk) and V 
lambda (Va). For each sequence, the gene sequence was then translated into the 
corresponding amino acid sequence. Subsequently, all amino acid sequences were 
aligned according to Kabat et al. (1991). In the case of \fX sequences, the 
numbering system of Chuchana et al. (1990) was used. Each of the three main 
databases was then divided into two further sub bases: the first sub base contained 
all sequences derived from rearranged V genes, where more than 70 positions of 
the sequence were known. The second sub base contained all germline gene 
segments (without the D- and J- minigenes; pseudogenes with internal stop codons 
were also removed). In all cases, where germline sequences with identical amino 
acid sequence but different names were found, only one sequence was used (see 
Table 1). The final databases of rearranged sequences contained 386, 149 and 
674 entries for Vk, Vk and VH, respectively. The final databases of germline 
sequences contained 48, 26 and 141 entries for Vk, VA, and VH, respectively. 

1.1.2 Assignment of sequences to subgroups 

The sequences in the three germline databases where then grouped according to 
sequence homology (see also Tomlinson et al., 1992, Williams & Winter, 1993, and 
Cox et a!., 1994). In the case of Vk, 7 families could be established. VK was divided 
into 8 families and VH into 6 families. The VH germline genes of the VH7 family (Van 
Dijk et al. t 1993) were grouped into the VH1 family, since the genes of the two 
families are highly homologous. Each family contained different numbers of 
germline genes, varying from 1 (for example VH6) to 47 (VH3). 
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1.2 Analysis of sequences 

1.2.1 Computation of germline membership 

For each of the 1209 amino acid sequences in the databases of rearranged genes, 
the nearest germline counterpart, i.e. the germline sequence with the smallest 
number of amino acid differences was then calculated. After the germline 
counterpart was found, the number of somatic mutations which occurred in the 
rearranged gene and which led to amino acid exchanges could be tabulated. In 140 
cases, the germline counterpart could not be calculated exactly, because more than 
one germline gene was found with an identical number of amino acid exchanges. 
These rearranged sequences were removed from the database. In a few cases, the 
number of amino acid exchanges was found to be unusually large (>20 for VL and 
>25 for VH), indicating either heavily mutated rearranged genes or derivation from 
germline genes not present in the database. Since it was not possible to distinguish 
between these two possibilities, these sequences were also removed from the 
database. Finally, 12 rearranged sequences were removed from the database 
because they were found to have very unusual CDR lengths and composition or 
unusual amino acids at canonical positions (see below). In summary, 1023 
rearranged sequences out of 1209 (85%) could be clearly assigned to their 
germline counterparts (see Table 2). 

After this calculation, every rearranged gene could be arranged in one of the 
families established for the germline genes. Now the usage of each germline gene, 
i.e. the number of rearranged genes which originate from each germline gene, could 
be calculated (see Table 2). It was found that the usage was strongly biased towards 
a subset of germline genes, whereas most of the germline genes were not present 
as rearranged genes in the database and therefore apparently not used in the 
immune system (Table 3), This observation had already been reported in the case of 
Vk (Cox, et al., 1994). All germline gene families, where no or only very few 
rearranged counterparts could be assigned, were removed from the database, 
leaving 4 Vk, 3 V?v, and 6 VH families. 

1.2.2 Analysis of CDR conformations 

The conformation of the antigen binding loops of antibody molecules, the CDRs, is 
strongly dependent on both the length of the CDRs and the amino acid residues 
located at the so-called canonical positions (Chothia & Lesk, 1987). It has been 
found that only a few canonical structures exist, which determine the structural 
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repertoire of the immunoglobulin variable domains (Chothia et al., 1939). The 
canonical amino acid positions can be found in CDR as well as framework regions. 
The 13 used germiine families defined above (7 VL and 6 VH) were now analyzed 
for their canonical structures in order to define the structural repertoire encoded in 
these families. 

In 3 of the 4 Vk families (Vk1, 2 and 4), one different type of CDR1 conformation 
could be defined for every family. The family Vk3 showed two types of CDR1 
conformation: one type which was identical to Vk1 and one type only found in Vk3. 
All Vk CDR2s used the same type of canonical structure. The CDR3 conformation is 
not encoded in the germiine gene segments. Therefore, the 4 Vk families defined by 
sequence homology and usage corresponded also to 4 types of canonical 
structures found in Vk germiine genes. 

The 3 V\ families defined above showed 3 types of CDR1 conformation, each family 
with one unique type. The VX1 family contained 2 different CDR1 lengths (13 and 14 
amino acids), but identical canonical residues, and it is thought that both lengths 
adopt the same canonical conformation (Chothia & Lesk, 1987). In the CDR2 of the 
used Vk germlines, only one canonical conformation exists, and the CDR3 
conformation is not encoded in the germiine gene segments. Therefore, the 3 VX 
families defined by sequence homology and usage corresponded also to 3 types of 
canonical structures. 

The structural repertoire of the human VH sequences was analyzed in detail by 
Chothia et al., 1992. In total, 3 conformations of CDR1 (HM, H1-2 and H1-3) and 6 
conformations of CDR2 (H2-1, H2-2, H2-3, H2-4, H2-5 and H2-x) could be defined. 
Since the CDR3 is encoded in the D- and J-minigene segments, no particular 
canonical residues are defined for this CDR. 

All the members of the VH1 family defined above contained the CDR1 conformation 
H1-1, but differed in their CDR2 conformation: the H2-2 conformation was found in 6 
germiine genes, whereas the conformation H2-3 was found in 8 germiine genes. 
Since the two types of CDR2 conformations are defined by different types of amino 
acid at the framework position 72, the VH1 family was divided into two subfamilies: 
VH1A with CDR2 conformation H2-2 and VH1B with the conformation H2-3. The 
members of the VH2 family all had the conformations H1-3 and H2-1 in CDR1 and 
CDR2, respectively. The CDR1 conformation of the VH3 members was found in all 
cases to be H1-1, but 4 different types were found in CDR2 (H2-1, H2-3, H2-4 and 
H2-x). In these CDR2 conformations, the canonical framework residue 71 is always 
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defined by an arginine. Therefore, it was not necessary to divide ihe VH3 family into 
subfamilies, since the 4 types of CDR2 conformations were defined solely by the 
CDR2 itself. The same was true for the VH4 family. Here, all 3 types of CDR1 
conformations were found, but since the CDR1 conformation was defined by the 
CDR itself (the canonical framework residue 26 was found to be glycine in all 
cases), no subdivisions were necessary. The CDR2 conformation of the VH4 
members was found to be H2-1 in all cases. All members of the VH5 family were 
found to have the conformation H1-1 and H2-2, respectively. The single germline 
gene of the VH6 family had the conformations H1-3 and H2-5 in CDR1 and CDR2, 
respectively. 

In summary, all possible CDR conformations of the Vk and Vk genes were present 
in the 7 families defined by sequence comparison. From the 12 different CDR 
conformations found in the used VH germline genes, 7 could be covered by dividing 
the family VH1 into two subfamilies, thereby creating 7 VH families. The remaining 5 
CDR conformations (3 in the VH3 and 2 in the VH4 family) were defined by the 
CDRs themselves and could be created during the construction of CDR libraries. 
Therefore, the structural repertoire of the used human V genes could be covered by 
49 (7 x 7) different frameworks. 

1.2.3 Computation of consensus sequences 

The 14 databases of rearranged sequences (4 Vk, 3 VX and 7 VH) were used to 
compute the HuCAL consensus sequences of each subgroup (4 HuCAL- Vk, 3 
HuCAL- Vk, 7 HuCAL- VH, see Table 4, 5 and 6). This was done by counting the 
number of amino acid residues used at each position (position variability) and 
subsequently identifying the amino acid residue most frequently used at each 
position. By using the rearranged sequences instead of the used germline 
sequences for the calculation of the consensus, the consensus was weighted 
according to the frequency of usage. Additionally, frequently mutated and highly 
conserved positions could, be identified. The consensus sequences were cross- 
checked with the consensus of the germline families to see whether the rearranged 
sequences were biased at certain positions towards amino acid residues which do 
not occur in the collected germline sequences, but this was found not to be the case. 
Subsequently, the number of differences of each of the 14 consensus sequences to 
each of the germline sequences found in each specific family was calculated. The 
overall deviation from the most homologous germline sequence was found to be 2.4 
amino acid residues (s.d. = 2.7), ensuring that the "artificial" consensus sequences 
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can still be considered as truly human sequences as far as imrr.unogcnicity is 
concerned. 



7.3 Structural analysis 

So far, only sequence information was used to design the consensus sequences. 
Since it was possible that during the calculation certain artificial combinations of 
amino acid residues have been created, which are located far away in the sequence 
but have contacts to each other in the three dimensional structure, leading to 
destabilized or even misfolded frameworks, the 14 consensus sequences were 
analyzed according to their structural properties. 

It was rationalized that all rearranged sequences present in the database 
correspond to functional and therefore correctly folded antibody molecules. Hence, 
the most homologous rearranged sequence was calculated for each consensus 
sequence. The positions where the consensus differed from the rearranged 
sequence were identified as potential "artificial residues" and inspected. 
The inspection itself was done in two directions. First, the local sequence stretch 
around , each potentially "artificial residue" was compared with the corresponding 
stretch of all the rearranged sequences. If this stretch was found to be truly artificial, 
i.e. never occurred in any of the rearranged sequences, the critical residue was 
converted into the second most common amino acid found at this position and 
analyzed again. Second, the potentially "artificial residues" were analyzed for their 
long range interactions. This was done by collecting all available structures of 
human antibody variable domains from the corresponding PDB files and calculating 
for every structure the number and type of interactions each amino acid residue 
established to each side-chain. These "interaction maps" were used to analyze the 
probable side-chain/side-chain interactions of the potentially "artificial residues". As 
a result of this analysis, the following residues were exchanged (given is the name 
of the gene, the position according to Kabat's numbering scheme, the amino acid 
found at this position as the most abundant one and the amino acid which was used 
instead): 

VH2: S 65 T 

Vk1: N 3d A, 

Vk3: G 9 A, D 60 A, R^S 

VX3: V 7R T 
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1.4 Design of CDR sequences 

The process described above provided the complete consensus sequences derived 
solely from the databases of rearranged sequences. It was rationalized that the 
CDR1 and CDR2 regions should be taken from the databases of used germline 
sequences, since the CDRs of rearranged and mutated sequences are biased 
towards their particular antigens. Moreover, the germline CDR sequences are 
known to allow binding to a variety of antigens in the primary immune response, 
where only CDR3 is varied. Therefore, the consensus CDRs obtained from the 
calculations described above were replaced by germline CDRs in the case of VH 
and Vk. In the case of VX, a few amino acid exchanges were introduced in some of 
the chosen germline CDRs in order to avoid possible protease cleavage sites as 
well as possible structural constraints. 

The CDRs of following germline genes have been chosen: 



HuCAL gene 

HuCAL-VH1A 

HuCAL-VH1B 

HuCAL-VH2 

HuCAL-VH3 

HuCAL-VH4 

HuCAL-VH5 

HuCAL-VH6 

HuCAL- Vk1 

HuCAL-Vk2 

HuCAL-Vk3 

HuCAL-Vk4 

HuCAL- V?J 

HuCAL-VX2 

HuCAL-V?w3 



CDR1 
VH 1-1 2-1 
VH1-13-16 
VH2-31-10.-1 1,-12,-13 
VH3-1 3-8,-9,-10 
VH4-1i-7to-14 

VH5-12-1.-2 
VH.6-35-1 
Vk1-14,-15 
Vk2-6 
Vk3-1,-4 
Vk4-1 
HUMLV1 17.DPL5 
DPL11.DPL12 
DPL23 



CDR2 
VH1-12-1 
VH1 -13-6,-7,-8,-9 

VH2-31-3.-4 
VH3-1 3-8,-9,-10 
VH4-1 1-8,-9,-11,-12,-14,-16 
VH4-31 -17,-1 8,-1 9,-20 
VH5-12-1.-2 
VH6-35-1 
Vk 1 -2,-3,-4,-5,-7,-8,-1 2,- 1 3,-1 8,-1 9 
Vk2-6 
Vk3-4 
Vk4-1 
DPL5 
DPL12 
HUMLV318 
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In the case of the CDR3s, any sequence could be chosen since these CDRs were 
planned to be the first to be replaced by oligonucleotide libraries. In order to study 
the expression and folding behavior of the consensus sequences in E. coli, it would 
be useful to have all sequences with the same CDR3, since the influence of the 
CDR3s on the folding behavior would then be identical in all cases. The dummy 
sequences QQHYTTPP and ARWGGDGFYAMDY were selected for the VL chains 
(kappa and lambda) and for the VH chains, respectively. These sequences are 
known to be compatible with antibody folding in E. coli (Carter et al., 1992). 



1.5 Gene design 

The final outcome of the process described above was a collection of 14 HuCAL 
amino acid sequences, which represent the frequently used structural antibody 
repertoire of the human immune system (see Figure 2).- These sequences were 
back-translated into DNA sequences. In a first step t the back-translation was done 
using only codons which are known to be frequently used in E. coli These gene 
sequences were then used for creating a database of ail possible restriction 
endonuclease sites, which could be introduced without changing the corresponding 
amino acid sequences. Using this database, cleavage sites were selected which 
were located at the flanking regions of all sub-elements of the genes (CDRs and 
framework regions) and which could be introduced in all HuCAL VH, Vk or V?, 
genes simultaneously at the same position. In a few cases it was not possible to find 
cleavage sites for all genes of a subgroup. When this happened, the amino acid 
sequence was changed, if this was possible according to the available sequence 
and structural information. This exchange was then analyzed again as described 
above. In total, the following 6 amino acid residues were exchanged during this 
design (given is the name of the gene, the position according to Kabat's numbering 
scheme, the amino acid found at this position as the most abundant one and the 
amino acid which was used instead): 
VH2: T 3 Q 
VH6: S, 2 G 
Vk3: E.DJseV 
Vk4: K 24 R 
VX3: T„S 
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In one case (5'-end of VH framework 3) it was not possible to identify a single 
cleavage site for all 7 VH genes. Two different type of cleavage sites were used 
instead: BstEII for HuCAL VH1A, VH1B, VH4 and VH5, and NspV for HuCAL VH2, 
VH3, VH4 and VH6. 

Several restriction endonuclease sites were identified, which were not located at the 
flanking regions of the sub-elements but which could be introduced in every gene of 
a given group without changing the amino acid sequence. These cleavage sites 
were also introduced in order to make the system more flexible for further 
improvements. Finally, all but one remaining restriction endonuclease sites were 
removed in every gene sequence. The single cleavage site, which was not removed 
was different in all genes of a subgroup and could be therefore used as a 
"fingerprint" site to ease the identification of the different genes by restriction digest. 
The designed genes, together with the corresponding amino acid sequences and 
the group-specific restriction endonuclease sites are shown in Figure 3, 4 and 5, 
respectively. 



1.6 Gene synthesis and cloning 

The consensus genes were synthesized using the method described by Prodromou 
& Pearl, 1992, using the oligonucleotides shown in Fig. 6. Gene segments encoding 
the human constant domains Ck, CX and CHI were also synthesized, based on 
sequence information given by Kabat et al., 1991 (see Fig. 6 and Fig. 7). Since for 
both the CDR3 and the framework. 4 gene segments identical sequences were 
chosen in all HuCAL Vk, VX and VH genes, respectively, this part was constructed 
only once, together with the corresponding gene segments encoding the constant 
domains. The PCR products were cloned into pCR-Script KS(+) (Stratagene, Inc.) or 
pZErO-1 (Invitrogen, Inc.) and verified by sequencing. 

Example 2: Cloning and Testing of a HuCAL-Based Antibody Library 

A combination of two of the synthetic consensus genes was chosen after 
construction to test whether binding antibody fragments can be isolated from a 
library based on these two consensus frameworks. The two genes were cloned as a 
single-chain Fv (scFv) fragment, and a VH-CDR3 library was inserted. In order to test 
the library for the presence of functional antibody molecules, a selection procedure 
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was carried out using the small hapten fluorescein bound to 6SA iFlfC-BSA) as 
antigen. 



2.7 Cloning of the HuCAL VH3-Vk2 scFv fragment 

In order to test the design of the consensus genes, one randomly chosen 
combination of synthetic light and heavy gene (HuCAL-Vk2 and HuCAL-VH3) was 
used for the construction of a single-chain antibody (scFv) fragment. Briefly, the 
gene segments encoding the VH3 consensus gene and the CH1 gene segment 
including the CDR3 - framework 4 region, as well -as the Vk2 consensus gene and 
the Ck gene segment including the CDR3 - framework 4 region were assembled 
yielding the gene for the VH3-CH1 Fd fragment and the gene encoding the Vk2-Ck 
light chain, respectively. The CH1 gene segment was then replaced by an 
oligonucleotide cassette encoding a 20-mer peptide linker with the sequence 
AGGGSGGGGSGGGGSGGGGS. The two oligonucleotides encoding this linker 
were 5'- TCAGCGGGTGGCGGTTCTGGCGGCGGTGGGAGCGGTGGCGGTGGTTC- 
TGGCGGTGGTGGTTCCGATATCGGTCCACGTACGG-3 1 and S'-AATTCCGTACG- 
TGGACCGATATCGGAACCACCACCGCCAGAACCACCGCCACCGCTCCCACCGC 
CGCCAGAACCGCCACCCGC-3\ respectively. Finally, the HuCAL-Vk2 gene was 
inserted via EcoRV and BsiWI into the plasmid encoding the HuCAL-VH3-linker 
fusion, leading to the final gene HuCAL-VH3-Vk2, which encoded the two 
consensus sequences in the single-chain format VH-linker-VL. The complete coding 
sequence is shown in Fig. 8. 



2.2 Construction of a monovalent phage-dispiay phagemid vector 
plG10.3 

Phagemid plG10.3 (Fig. 9) was constructed in order to create a phage-display 
system (Winter et al., 1994) for the H3k2 scFv gene. Briefly, the EcoRI/Hindlll 
restriction fragment in the phagemid vector plG10 (Ge et al. t 1995) was replaced by 
the c-myc followed by an amber codon (which encodes an glutamate in the amber- 
suppresser strain XL1 Blue and a stop codon in the non-suppresser strain JM83) 
and a truncated version of the gene III (fusion junction at codon 249, see Lowman et 
al, 1991) through PCR mutagenesis. 
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2.3 Construction of H-CDR3 libraries 

Heavy chain CDR3 libraries of two lengths (10 and 15 amino acids) were 
constructed using trinucleotide codon containing oligonucleotides (Virnekas et al M 
1994) as templates and the oligonucleotides complementing the flanking regions as 
prrmers. To concentrate only on the CDR3 structures that appear most often in 
functional antibodies, we kept the salt-bridge of and D H101 in the CDR3 loop. For 
the 15-mer library, both phenylalanine and methionine were introduced at position 
1 00 since these two residues were found to occur quite often in human CDR3s of 
this length (not shown). For the same reason, valine and tyrosine were introduced at 
position 102. All other randomized positions contained codons for all amino acids 
except cystein, which was not used in the trinucleotide mixture. 
The CDR3 libraries of lengths 10 and 15 were generated from the PCR fragments 
using oligonucleotide templates O3HCDR103T (5'- GATACGGCCGTGTATTA- 
TTGCGCGCGT (TRI^GATTATrGGGGCCAAGGCACCCTG-S 1 ) and 03HCDR153T 
(5-GATACGGCCGT GTATTATTGCGCGCGT(TRI) I0 (TTT/ATG)GAT(GTT/TAT)TGGG- 
GCCAAGGCACCCTG-3'), and primers 03HCDR35 (S'-GATACGGCCGTGTATTA- 
TTGC-3') and 03HCDR33 (5'-CAGGGTGCCTTGGCCCC-3'), where TRI are 
trinucleotide mixtures representing all amino acids without cystein, (TTT/ATG) and 
(GTT/T AT) are trinucleotide mixtures encoding the amino acids 
phenylalanine/methionine and valine/tyrosine, respectively. The potential diversity 
of these libraries was 4.7 x 10 7 and 3.4 x 10 10 for 10-mer and 15-mer library, 
respectively. The library cassettes were first synthesized from PCR amplification of 
the oligo templates in the presence of both primers: 25 pmol of the oligo template 
O3HCDR103T or 03HCDR153T, 50 pmol each of the primers 03HCDR35 and 
03HCDR33, 20 nmol of dNTP, 10x buffer and 2.5 units of Pfu DNA polymerase 
(Stratagene) in a total volume of 100 jil for 30 cycles (1 minute at 92°C, 1 minute at 
62°C and 1 minute at 72°C). A hot-start procedure was used. The resulting mixtures 
were phenol-extracted, ethanol-precipitated and digested overnight with Eagl and 
Sty I. The vector plG10.3-scH3K2cat, where the Eagl-Styl fragment in the vector 
PIG10.3-scH3k2 encoding the H-CDR3 was replaced by the chloramphenicol 
acetyltransferase gene (cat) flanked with these two sites, was similarly digested. The 
digested vector (35 /yg) was gel-purified and iigated with 100 fjg of the library 
cassette overnight at 16°C. The ligation mixtures were isopropanol precipitated, air- 
dried and the pellets were redissoived in 100 ul of ddH20. The ligation was mixed 
with 1 ml of freshly prepared electrocompetent XL1 Blue on ice. 20 rounds of 
electroporation were performed and the transformants were diluted in SOC medium, 
shaken at 37°C for 30 minutes and plated out on large LB plates (Amp/Tet/Glucose) 
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at 37°C for 6-9 hrs. The number of transformants (library size) was 3.2x1 0 7 and 
2.3x1 0 7 for the 10-mer and the 15-mer library, respectively. The colonies were 
suspended in 2xYT medium (Amp/Tet/Glucose) and stored as glycerol culture. 
In order to test the quality of the initial library, phagemids from 24 independent 
colonies (12 from the 10-mer and 12 from the 15-mer library, respectively) were 
isolated and analyzed by restriction digestion and sequencing. The restriction 
analysis of the 24 phagemids indicated the presence of intact vector in all cases. 
Sequence analysis of these clones (see Fig. 10) indicated that 22 out of 24 
contained a functional sequence in their heavy chain CDR3 regions. 1 out of 12 
clones of the 10-mer library had a CDR3 of length 9 instead of 10, and 2 out of 12 
clones of the 15-mer library had no open reading frame, thereby leading to a non- 
functional scFv; one of these two clones contained two consecutive inserts, but out 
of frame (data not shown). All codons introduced were presented in an even 
distribution. 

Expression levels of individual library members were also measured. Briefly, 9 
clones from each library were grown in 2xYT medium containing Amp/Tet/0.5% 
glucose at 37°C overnight. Next day, the cultures were diluted into fresh medium 
with Amp/Tet. At an OD^^ of 0.4, the cultures were induced with 1 mM of IPTG and 
shaken at RT overnight. Then the cell pellets were suspended in 1 ml of PBS buffer 
+ 1 mM of EDTA. The suspensions were sonicated and the supernatants were 
separated on an SDS-PAGE under reducing conditions, blotted on nylon membrane 
and detected with anti-FLAG M1 antibody (see Fig. 11). From the nine clones of the 
10-mer library, all express the scFv fragments. Moreover, the gene III / scFv fusion 
proteins were present in all cases. Among the nine clones from the 15-mer library 
analyzed, 6/9 (67%) led to the expression of both scFv and the gene lll/scFv fusion 
proteins. More importantly, all clones expressing the scFvs and gene ill/scFv fusions 
gave rise to about the same level of expression. 

2.4 Biopanning 

Phages displaying the antibody libraries were prepared using standard protocols. 
Phages derived from the 10-mer library were mixed with phages from the 15-mer 
library in a ratio of 20:1 (1x10 t0 cfu/well of the 10-mer and 5x10 s cfu/well of the 15- 
mer phages, respectively). Subsequently, the phage solution was used for panning 
in ELISA plates (Maxisorp, Nunc) coated with FITC-BSA (Sigma) at concentration of 
100 pg/ml in PBS at 4°C overnight. The antigen-coated wells were blocked with 3% 
powder milk in PBS and the phage solutions in 1% powder milk were added to each 
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well and the plate was shaken at RT for 1 hr. The wells were then washed wit u , 
PBST and PBS (4 times each with shaking at RT for 5 minutes). The bound phages 
were eluted with 0.1 M triethylamine (TEA) at RTfor 10 minutes. The eluted phage 
solutions were immediately neutralized with 1/2 the volume of 1 M Tris-CI, pH 7.6. 
Eluted phage solutions (ca. 450 y\) were used to infect 5 ml of XL1 Blue cells at 
37°C for 30 min. The infected cultures were then plated out on large LB plates 
(Amp/Tet/Glucose) and allowed to grow at 37°C until the colonies were visible. The 
colonies were suspended in 2xYT medium and the glycerol cultures were made as 
above described. This panning round was repeated twice, and in the third round 
elution was carried out with addition of fluorescein in a concentration of 100 yg/ml in 
PBS. The enrichment of specific phage antibodies was monitored by panning the 
initial as well as the subsequent fluorescein-specific sub-libraries against the 
blocking buffer (Fig. 12). Antibodies with specificity against fluorescein were 
isolated after 3 rounds of panning. 

2.5 ELISA measurements 

One of the criteria for the successful biopanning is the isolation of individual phage 
clones that bind to the targeted antigen or hapten. We undertook the isolation of 
anti-FITC phage antibody clones and characterized them first in a phage ELISA 
format. After the 3rd round of biopanning {see above), 24 phagemid containing 
clones were used to inoculate 100 /vl of 2xYT medium (Amp/Tet/Glucose) in an 
ELISA plate (Nunc), which was subsequently shaken at 37°C for 5 hrs. 100 y\ of 
2xYT medium (Amp/Tet/1 mM !PTG) were added and shaking was continued for 30 
minutes. A further 100 y\ of 2xYT medium (Amp/Tet) containing the helper phage 
(1 x 10 9 cfu/well) was added and shaking was done at RTfor 3 hrs. After addition of 
kanamycin to select for successful helper phage infection, the shaking was 
continued overnight. The plates were then centrifuged and the supernatants were 
pipetted directly into ELISA wells coated with 100 y\ FITC-BSA (lOOprg/ml) and 
blocked with milk powder. Washing was performed similarly as during the panning 
procedure and the bound phages were detected with anti-Ml3 antibody- 
POD conjugate (Pharmacia) using soluble POD substrate (Boehringer-Mannheim). 
Of the 24 clones screened against FITC-BSA, 22 were active in the ELISA (Fig. 13). 
The initial libraries of similar titer gave rise to no detectable signal. 
Specificity for fluorescein was measured in a competitive ELISA. Periplasmic 
fractions of five FITC specific scFvs were prepared as described above. Western 
blotting indicated that all clones expressed about the same amount of scFv fragment 
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(data not shown). ELISA was performed as described above, but additionally, ihe 
periplasmic fractions were incubated 30 min at FIT either with buffer (no inhibition), 
with 10 mg/ml BSA (inhibition with BSA) or with 10 mg/ml fluorescein (inhibition 
with fluorescein) before adding to the well. Binding scFv fragment was detected 
using the anti-FLAG antibody M1. The ELISA signal could only be inhibited, when 
soluble fluorescein was added, indicating binding of the scFvs was specific for 
fluorescein (Fig. 14). 

2.6 Sequence analysis 

The heavy chain CDR3 region of 20 clones were sequenced in order to estimate the 
sequence diversity of fluorescein binding antibodies in the library (Fig. 15). In total, 
16 of 20 sequences (80%) were different, showing that the constructed library 
contained a highly diverse repertoire of fluorescein binders. The CDR3s showed no 
particular sequence homology, but contained on average 4 arginine residues. This 
bias towards arginine in fluorescein binding antibodies had already been described 
by Barbas et al M 1992. 

2.7 Production 

E. coli JM83 was transformed with phagemid DNA of 3 selected clones and 
cultured in 0.5 L 2xYT medium. Induction was carried out with 1 mM IPTG at 
ODeoonm = °- 4 and 9 r o wth wa s continued with vigorous shaking at RT overnight. 
The cells were harvested and pellets were suspended in PBS buffer and sonicated. 
The supernatants were separated from the cell debris via centrifugation and purified 
via the BioLogic system (Bio-Rad) by with a POROS®MC 20 column (IMAC, 
PerSeptive Biosystems, Inc.) coupled with an ion-exchange chromatography 
column. The ion-exchange column was one of the POROSES, CM or HQ or PI 20 
(PerSeptive Biosystems, Inc.) depended on the theoretical pi of the scFv being 
purified. The pH of all the buffers was adjusted to one unit lower or higher than the pi 
of the scFv being purified throughout. The sample was loaded onto the first IMAC 
column, washed with 7 column volumes of 20 mM sodium phosphate, 1 M NaCI and 
10 mM imidazole. This washing was followed by 7 column volumes of 20 mM 
sodium phosphate and 10 mM imidazole. Then 3 column volumes of an imidazole 
gradient (10 to 250 mM) were applied and the eluent was connected directly to the 
ion-exchanger. Nine column volumes of isocratic washing with 250 mM imidazole 
was followed by 15 column volumes of 250 mM to 100 mM and 7 column volumes of 
an imidazole / NaCI gradient (100 to 10 mM imidazole, 0 to 1 M NaCI). The flow rate 
was 5 ml/min. The purity of scFv fragments was checked by SDS-PAGE Coomassie 
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staining (Fig. 16). The concentration of the fragments was determined from the 
absorbance at 280 nm using the theoretically determined extinction coefficient (Gill 
& von Hippel, 1989). The scFv fragments could be purified to homogeneity (see 
Fig. 1 6). The yield of purified fragments ranged from 5 to 10 mg/L/OD. 



Example 3; HuCAL H3k2 Library Against a Collection of Antigens 

In order to test the library used in Example 2 further, a new selection procedure was 
carried out using a variety of antigens comprising B-estradiol t testosterone, Lewis-Y 
epitope (LeY), interleukin-2 (IL-2), lymphotoxin-B (LT-I3), E-selectin ligand-1 (ESL-1), 
and BSA. 

3.1 Biopanning 

The library and all procedures were identical to those described in Example 2. The 
ELISA plates were coated with B-estradiol-BSA (100 /;g/ml), testosterone-BSA (100 
pg/ml), LeY-BSA (20/yg/ml) IL-2 (20//g/ml), ESL-1 (20 /ig/ml) and BSA (100 //g/ml), 
LT-B (denatured protein, 20 /7g/ml). In the first two rounds, bound phages were 
eluted with 0.1 M triethylamine (TEA) at RT for 10 minutes. In the case of BSA, 
eiution after three rounds of panning was carried out with addition of BSA in a 
concentration of 100 /7g/ml in PBS. In the case of the other antigens, third round 
eiution was done with 0.1 M triethylamine. in all cases except LeY, enrichment of 
binding phages could be seen (Figure 17). Moreover, a repetition of the biopanning 
experiment using only the 15-mer library resulted in the enrichment of LeY-binding 
phages as well (data not shown). 

3.2. ELISA measurements 

Clones binding to B-estradio!, testosterone, LeY, LT-B, ESL-1 and BSA were further 
analyzed and characterized as described in Example 2 for FITC. ELISA data for anti- 
B-estradiol and anti-ESL-1 antibodies are shown in Fig. 18. In one experiment, 
selectivity and cross-reactivity of binding scFv fragments were tested. For this 
purpose, an ELISA plate was coated with FITC, testosterone, B-estradiol, BSA, and 
ESL-1, with 5 wells for each antigen arranged in 5 rows, and 5 antibodies, one 
against each of the antigens, were screened against each of the antigens. Fig. 19 
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shows the specific binding of the antibodies to the antigen if was selected for, and 
the low cross-reactivity with the other four antigens. 

3.3 Sequence analysis 

The sequencing data of several clones against B-estradiol (34 clones), testosterone 
(12 clones), LT-B (23 clones), ESL-1 (34 clones), and BSA (10 clones) are given in 
Figures 20 to 24. 

Example 4: Vector Construction 

To be able to take advantage of the modularity of the consensus gene repertoire, a 
vector system had to be constructed which could be used in phage display 
screening of HuCAL libraries and subsequent optimization procedures. Therefore, 
all necessary vector elements such as origins of single-stranded or double-stranded 
replication, promotor/operator, repressor or terminator elements, resistance genes, 
potential recombination sites, gene III for display on filamentous phages, signal 
sequences, or detection tags had to be made compatible with the restriction site 
pattern of the modular consensus genes. Figure 25 shows a schematic 
representation of the pCAL vector system and the arrangement of vector modules 
and restriction sites therein. Figure 25a shows a list of all restriction sites which are 
already incorporated into the consensus genes or the vector elements as part of the 
modular system or which are not yet present in the whole system. The latter could be 
used in a later stage for the introduction of or within new modules. 

4. 1 Vector modules 

A series of vector modules, was constructed where the restriction sites flanking the 
gene sub-elements of the HuCAL genes were removed, the vector modules 
themselves being flanked by unique restriction sites. These modules were 
constructed either by gene synthesis or by mutagenesis of templates. Mutagenesis 
was done by add-on PCR, by site-directed mutagenesis (Kunkel et aL t 1991) or 
multisite oiigonucleotide-mediated mutagenesis (Sutherland et al., 1995; Perlak, 
1990) using a PCR-based assembly method. 
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Figure 26 contains a list of the modules constructed. Instead of the terminator 
module M9 (Hindlll-Ipp-Pacl), a larger cassette M9II was prepared to introduce Fsel 
as additional restriction site. M9H can be cloned via Hindlll/BsrGI. 
Ail vector modules were characterized by restriction analysis and sequencing. In the 
case of module Ml 1 -II, sequencing of the module revealed a two-base difference in 
positions 164/65 compared to the sequence database of the template. These two 
different bases (CA GC) created an additional Banll site. Since the same two- 
base difference occurs in the f1 origin of other bacteriophages, it can be assumed 
that the two-base difference was present in the template and not created by 
mutagenesis during cloning. This Banll site was removed by site-directed 
mutagenesis, leading to module M11-III. The BssSI site of module M14 could initially 
not be removed without impact on the function of the ColE1 origin, therefore M14- 
Ext2 was used for cloning of the first pCAL vector series. Figures 29 to 34 are 
showing the functional maps and sequences of the modules used for assembly of 
the modular vector pCAL4 (see below). The functional maps and sequences of 
additional modules can be found in Figure 35a. Figure 35b contains a list of 
oligonucleotides and primers used for the synthesis of the modules. 

4.2 Cloning vector pMCS 

To be able to assemble the individual vector modules, a cloning vector pMCS 
containing a specific multi-cloning site (MCS) was constructed. First, an MCS 
cassette (Fig. 27) was made by gene synthesis. This cassette contains all those 
restriction sites in the order necessary for the sequential introduction of all vector 
modules and can be cloned via the 5'-HindlII site and a four base overhang at the 
3'-end compatible with an Aatll site. The vector pMCS (Figure 28) was constructed 
by digesting pUC19 with Aatll and Hindlll, isolating the 2174 base pair fragment 
containing the bla gene and the CoIE1 origin, and ligating the MCS cassette. 

4.3 Cloning of modular vector pCAL4 

This was cloned step by step by restriction digest of pMCS and subsequent ligation 
of the modules M1 '(via Aatll/Xbal), M7IH (via EcoRI/Hindlll), and M9I1 (via 
Hindlll/BsrGI), and Ml 1-11 (via BsrGI/Nhei). Finally, the bla gene was replaced by the 
cat gene module M17 (via Aatll/Bglll), and the wild type ColEl origin by module 
M14-Ext2 (via Bglll/Nhel). Figure 35 is showing the functional map and the 
sequence of pCAL4. 
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4.4 Cloning of low-copy number plasmid vectors pCALO 

A series of low-copy number plasmid vectors was constructed in a similar way using 
the p15A module M12 instead of the ColE1 module M14-Ext2. Figure 35a is 
showing the functional maps and sequences of the vectors pCALOl to pCAL03. 

Example 5: Construction of a HuCAL scFv Library 
5.1. Cloning of all 49 HuCAL scFv fragments 

All 49 combinations of the 7 HuCAL-VH and 7 HuCAL- VL consensus genes were 
assembled as described for the HuCAL VH3-Vk2 scFv in Example 2 and inserted 
into the vector pBS12, a modified version of the pLisc series of antibody expression 
vectors (Skerra et a/., 1991). 

5.2 Construction of a CDR cloning cassette 

For replacement of CDRs ( a universal B-lactamase cloning cassette was constructed 
having a multi-cloning site at the 5'-end as well as at the 3'-end. The S'-muIti-cloning 
site comprises all restriction sites adjacent to the 5'-end of the HuCAL VH and VL 
CDRs, the 3'-multi-cloning site comprises all restriction sites adjacent to the 3' end of 
the HuCAL VH and VL CDRs. Both 5'- and 3'-multi-cloning site were prepared as 
cassettes via add-on PCR using synthetic oligonucleotides as 5'- and 3'-primers 
using wild type (3-lactamase gene as template. Figure 36 shows the functional map 
and the sequence of the cassette bla-MCS. 

5.3. Preparation of VL-CDR3 library cassettes 

The VL-CDR3 libraries comprising 7 random positions were generated from the 
PCR fragments using oligonucleotide templates Vk1&Vk3, Vk2 and Vk4 and 
primers 0_K3L_5 and 0_K3L_3 (Fig. 37) for the Vk genes, and V?, and primers 
0_L3L_5 (5'-GCAGAAGGCGAACGTCC-3') and 0_L3LA_3 (Fig. 38) for the Va 
genes. Construction of the cassettes was performed as described in Example 2.3. 
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5.4 Cloning of HuCAL scFv genes with VL-CDR3 libraries 

Each of the 49 single-chains was subcloned into pCAL4 via Xbal/EcoRI and the VL- 
COR3 replaced by the B-lactamase cloning cassette via Bbsl/Mscl, which was then 
replaced by the corresponding VL-CDR3 library cassette synthesized as described 
above. This CDR replacement is described in detail in Example 2.3 where the cat 
gene was used. 

5.5 Preparation of VH-CDR3 library cassette 

The VH-CDR3 libraries were designed and synthesized as described in Example 
2.3. 

5.6 Cloning of HuCAL scFv genes with VL~ and VH-CDR3 libraries 

Each of the 49 single-chain VL-CDR3 libraries was digested with BssHII/Styl to 
replace VH-CDR3. The "dummy" cassette digested with BssHII/Styl was inserted, 
and was then replaced by a corresponding VH-CDR3 library cassette synthesized 
as described above. 

Example 6: Expression tests 

Expression and toxicity studies were performed using the scFv format VH-linker-VL. 
All 49 combinations of the 7 HuCAL-VH and 7 HuCAL-VL consensus genes 
assembled as described in Example 5 were inserted into the vector pBS13, a 
modified version of the pLisc series of antibody expression vectors (Skerra et a/ M 
1 991 ). A map of this vector is shown in Fig. 39. 

£ coli JM83 was transformed 49 times with each of the vectors and stored as 
glycerol stock. Between 4 and 6 clones were tested simultaneously, always 
including the clone H3k2, which was used as internal control throughout. As 
additional control, the McPC603 scFv fragment (Knappik & Pluckthun, 1995) in 
PBS13 was expressed under identical conditions. Two days before the expression 
test was performed, the clones were cultivated on LB plates containing 30 //g/ml 
chloramphenicol and 60 mM glucose. Using this plates an 3 ml culture (LB medium 
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containing 90 yg chloramphenicol and 60 mM glucose) was inoculated overnight 
at 37 °C. Next day the overnight culture was used to inoculate 30 ml LB medium 
containing chloramphenicol (30 /vg/ml). The starting OD^^ was adjusted to 0.2 and 
a growth temperature of 30 °C was used. The physiology of the cells was monitored 
by measuring every 30 minutes for 8 to 9 hours the optical density at 600 nm. After 
the culture reached an OD^^ of 0.5, antibody expression was induced by adding 
IPTG to a final concentration of 1 mM. A 5 ml aliquot of the culture was removed 
after 2 h of induction in order to analyze the antibody expression. The cells were 
lysed and the soluble and insoluble fractions of the crude extract were separated as 
described in Knappik & Pluckthun, 1995. The fractions were assayed by reducing 
SDS-PAGE with the samples normalized to identical optical densities. After blotting 
and immunostaining using the a-FLAG antibody M1 as the first antibody (see Ge et 
a/., 1994) and an Fc-specific anti-mouse antiserum conjugated to alkaline 
phosphatase as the second antibody, the lanes were scanned and the intensities of 
the bands of the expected size (appr. 30 kDa) were quantified densitometrically and 
tabulated relative to the control antibody (see Fig. 40). 



Example 7: Optimization of Fluorescein Binders 

7.1. Construction of L-CDR3 and H-CDR2 library cassettes 

A L-CDR3 library cassette was prepared from the oligonucleotide template CDR3L 
(S'-TGGAAGCTGAAGACGTGGGCGTGTATTATTGCCAGCAGfTRSjrTRI^CCG^RI)- 
TTTGGCCAGGGTACGAAAGTT-3') and primer 5 , -AACTTTCGTACCCTGGCC-3 > for 
synthesis of the complementary strand, where (TRI) was a trinucleotide mixture 
representing all amino acids except Cys, (TR5) comprised a trinucleotide mixture 
representing the 5 codons for Ala, Arg, His, Ser, and Tyr. 

A H-CDR2 library cassette was prepared from the oligonucleotide template CDRsH 
(5-AGGGTCTCGAGTGGGTGAGC(TRI)ATT(TRI) 2 . 3 (6) 2 (TRI)ACC(TRI)TATGCGGATA- 
GCGTGAAAGGCCGTTTTACCATTTCACGTGATAATTCGAAAAACACCA-3 T ), and 
primer S'-TGGTG I I It I CGAATTATCA-3' for synthesis of the complementary strand, 
where (TRI) was a trinucleotide mixture representing all amino acids except Cys, (6) 
comprised the incorporation of (A/G) (A/C/G) T, resulting in the formation of 6 codons 
for Ala, Asn, Asp, Gly, Ser, and Thr, and the length distribution being obtained by 
performing one substoichiometric coupling of the (TRI) mixture during synthesis, 
omitting the capping step normally used in DNA synthesis. 
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DNA synthesis was performed on a 40 nmole scale, o!igos were oissotved *n TF 
buffer, purified via gel filtration using spin columns (S-200), and the DNA 
" concentration determined by OD measurement at 260 nm (OD 1.0 = 40 £/g/mI). 
10 nmole of the oligonucleotide templates and 12 nmole of the corresponding 
primers were mixed and annealed at 80°C for 1 min, and slowly cooled down to 
37°C within 20 to 30 min. The fill-in reaction was performed for 2 h at 37°C using 
Klenow polymerase (2.0 //!) and 250 nmole of each dNTP. The excess of dNTPs 
was removed by gel filtration using Nick-Spin columns (Pharmacia), and the double- 
stranded DNA digested with Bbsl/Mscl (L-CDR3), or Xhol/Sful (H-CDR2) over night 
at 37°C. The cassettes were purified via Nick-Spin columns (Pharmacia), the 
concentration determined by OD measurement, and the cassettes aliquoted (15 
pmole) for being stored at -80°C. 

7.2 Library cloning: 

DNA was prepared from the collection of FITC binding clones obtained in Example 2 
(approx. 10 4 to clones). The collection of scFv fragments was isolated via Xbal/EcoRI 
digest. The vector pCAL4 (100 fmole, 10 /vg) described in Example 4.3 was similarly 
digested with Xbal/EcoRI, gel-purified and ligated with 300 fmole of the scFv 
fragment collection over night at 16°C. The ligation mixture was isopropanol 
precipitated, air-dried, and the pellets were redissolved in 100 fj\ of dd H 2 0. The 
ligation mixture was mixed with 1 ml of freshly prepared electrocompetent SCS 101 
cells (for optimization of L-CDR3), or XL1 Blue cells (for optimization of H-CDR2) on 
ice. One round of electroporation was performed and the transformants were eluted 
in SOC medium, shaken at 37°C for 30 minutes, and an aliquot plated out on LB 
plates (AmpATet/Glucose) at 37°C for 6-9 hrs. The number of transformants was 5 x 
10 4 . 

Vector DNA (100 (jg) was isolated and digested (sequence and restriction map of 
scH3k2 see Figure 8) with. Bbsl/Mscl for optimization of L-CDR3, or Xhol/NspV for 
optimization of H-CDR2. 10 /vg of purified vector fragments (5 pmole) were ligated 
with 15 pmole of the L-CDR3 or H-CDR2 library cassettes over night at 16°C. The 
ligation mixtures were isopropanol precipitated, air-dried, and the pellets were 
redissolved in 100 /vl of dd H 2 0. The ligation mixtures were mixed with 1 ml of freshly 
prepared electrocompetent XL1 Blue cells on ice. Electroporation was performed 
and the transformants were eluted in SOC medium and shaken at 37°C for 30 
minutes. An aliquot was plated out on LB plates (Amp/Tet/Glucose) at 37°C for 6-9 
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hrs. The number of transformants (library size) was greater than i0 8 tor both 
libraries. The libraries were stored as glycerol cultures. 

7.3. Biopanning 

This was performed as described for the initial H3k2 H-CDR3 library in Example 2.1. 
Optimized scFvs binding to FITC could be characterized and analyzed as described 
in Example 2.2 and 2.3, and further rounds of optimization could be made if 
necessary. 
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Table 1 A: Human kappa germiine gene 



Used Name' Reference 2 Family 3 



Vkl-1 


9 


1 


.Vkl-2 


1 


1 


Vk1-3 


2 


1 


Vkl-4 


9 


1 


Vkl-5 


2 


! 


Vk1-6 


1 


1 


Vk1-7 


1 


! 


Vk1-8 


1 


1 


Vkl-9 


1 


1 


Vkl-10 


1 


, 


Vkl-11 


1 




Vkl-12 


2 


1 


Vkl-13 


2 


1 


Vkl-14 


8 


1 


Vkl-15 


8 


1 


vki-ie 


1 


! 


Vkl-17 


2 


1 


Vkl-18 


1 


, 


Vk1-19 


6 


1 


Vkl-20 


2 


! 


Vkl-21 


1 


1 


Vkl-22 


2 


1 


Vkl-23 


2 


, 


Vk2-1 


1 


2 


Vk2-2 


6 


2 


Vk2-3 


6 


2 


Vk2-4 


2 


2 


Vk2-5 


1 


2 


Vk2-G 


4 


2 


Vk2-7 


4 


2 


Vk2-8 


4 


2 


Vk2-9 


1 


2 



Germiine genes* 

08.018; DPK1 
L14;DPK2 

L15(1); HK101; HK146; HK189 

LI 1 

A30 

LFVK5 

LFVK431 

Ll; HK137 

A20; DPK4 

118; Va" 

L4; L1B;Va';V4a 

L5; LI 9(1); Vb; Vb4; DPK5; Li 9(2); Vb"; DPK6 
L15(2);HK134; HK166; DPK7 
L8;Vd; DPK8 
L9;Ve 

L12(1);HK102;V1 

LI 2(2) 

012a (V3b) 

02;012;DPK9 

L24;Ve";Vl3;DPK10 

04; 014 

L22 

L23 

A2; DPK12 

01:011(1); DPK13 

Ol2(2);V3a 

L13 

DPK14 

A3;A19;DPK15 
A29; DPK27 
A13 
A23 
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Used Name 1 Reference 3 Family 3 Germline genes 4 



Vk2-10 


4 


2 




Vk2-11 


4 


2 


a 17. nPKift 


Vk2-12 


4 


2 


A 1 • nPK 1Q 
A I , Urfv 1 3 


Vk3-1 


11 


3 


MM, nUIIIKVJUj, UflvtU 


Vk3-2 


1 


3 


1 on- \/n" 

LzU, vg 


Vk3-3 


2 


3 


n- 1 ic- humlrvloR- hnmkv?7Rh?- humkv328h5' DPK21 


Vk3-4 


11 


' 3 


A.T7- hnmlruT?^- VkRF- DPK22 


Vk3-5 


2 


3 


L25; UrKzJ 


Vk3-6 


2 


3 


L10(l) 


Vk3-7 


7 


3 


L10(2) 


Vk3-8 


7 


3 


L6;Vg 


Vk4-1 


3 


4 


B3;VklV;DPK24 


Vk5-1 


10 


5 


B2;EV15 


Vk6-1 


12 


6 


A14; DPK25 


Vk6-2 


12 


6 


A10;A26; DPK26 


Vk7-1 


5 


7 


Bl 



4 ^ 
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Table IB: Human lambda germline gene segments 
Used Name' Reference' Family 5 Germline genes 4 



DPL1 ! 1 

DPL2 t 1 HUMLV1L1 

DPL3 t 1 HUMLV122 

DPL4 t 1 VLAMBDA 1.1 

HUMLV117 2 1 

DPL5 i 1 HUMLV117D 

DPLG ! 1 

DPL7 t 1 IGLV1S2 

DPL8 t 1 HUMLV1042 

DPL9 t 1 HUMLV101 

DPL10 t 2 

VLAMBDA 2.1 3 2 

DPL11 i 2 

DPL12 i 2 

DPL13 t 2 

DPL14 i 2 

DPL16 t 3 Humlv418;IGLV3Sl 

DPL23 i 3 VI III. 1 

Humlv318 4 3 

DPL18 ! 7 4A; HUMIGLVA 

DPL19 t 7 

DPL21 t 8 VL8.1 

HUMLV801 5 8 

DPL22 t 9 

DPL24 t unassigned VLAMBDA N.2 

gVLX-4.4 6 10 
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Table 1C: Human heavy chain germline gene segments 



Used Name 1 Reference 2 Family 1 Germline genes* 



vn 1 - i z- i 


1Q 




DP10- DA-2- DA-G 


\/ui 19 ft 

vn l - I z-o 


77 




Ml \- V 1 1 1 


\/m 19 9 
Vn l ~ 1 Z-z 


c 

D 




hv12G3 


\/Hl 17-Q 

vn i i z j 


7 
/ 




YAC-7* RRVHl V 1-69 


VH1-1 2-1 


19 


1 


DP3 


VH 1-1 7-4 
vn i" it • 


19 


1 


DP21; 4d275a; VH7a 


VH1-12-5 


18 


1 


l-4.1b; Vl-4.1b 


VH1-12-6 


21 


1 


lD37;VH7b ; 7-81; YAC-10 


VH 1-1 2-7 
vn i*u / 


19 


] 


DP14; VHlGRR; Vl-18 


\/Hl-1 9-1 

vn i i j- i 


10 


1 


71-5; DP2 


VH1 -11-7 


10 


1 


E3-10 


VH 1-1 9-9 


19 


1 


DP1 


\/m 

vn I - I J *+ 


l? 


1 


V35 


\/m 19 ^ 
vn i - 1 j-d 


Q 
O 




V1-2h 


\/Ui 19 C 


1R 
1 o 




1-2; DP75 


\fU 1 19 7 

vn i - 1 <j / 


71 




V1-2 

V 1 c 


\/U 1 1 9 Q 

vn I - 1 o-o 


1Q 




DPft 


\/Ui 1 9 Q 

Vnl -1 o-y 


9 




1-1 

1 1 


\/u 1 n in 
Vnl-1 J- 1U 


1 Q 




dpi 7 


\/U 1 1 9 11 

Vnl - 1 J- 1 1 


i e; 




V 1 ov. 


\/Ul 19 19 

Vn 1- I J- I Z 


i ft 

1 o 




Mhr DP7 1 ;- VI -3b 


\/ui 19.19 
Vn 1 - 1 J- 1 j 


9 
O 




1-92 


\/m 19. id 
vn i - 1 j i f 


1ft 


1 


1-9- vi-3 


vn 1 — 1 o i j 


19 


1 


DP15; Vl-8 


VH1-11-1R 


3 


1 


21-2; 3-1; DP7; V1-46 


\/Ul 19 17 

Vn 1 - 1 j- 1 / 


1 K 




n vj J 


VH1-13-18 


19 




DP4; 7-2;Vl-45 


VH1-13-19 


27 




COS 5 


VH1-1X-1 


19 




DP5; 1-24P 


VH2-21-1 


18 


2 


ll-5b 


VH2-31-1 


2 


2 


VH2S12-1 


VH2-31-2 


2 


2 


VH2S12-7 


VH2-31-3 


2 


2 


VH2S12-9; DP27 


VH2-31-4 


2 


2 


VH2S12-10 


VH2-31-5 


14 


2 


V2-26; DP2G; 2-26 


VH2-31-6 


15 


2 


VF2-26 
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Table 1C: (continued) 



It 1 1 - - _ 1 

Used Name' 


Reference 2 


Family 3 


Germline genes 4 


Vnz-J I-/ 


1 Q 


I 


r\DOO« HA "7 


\ tl i Oil 1 A 

VH2-31-14 


•7 

7 


2 


YAC-3; 2-70 


VH2-31-8 


2 


2 


VH2S12-5 


VH2-31-9 


2 


2 


VH2S12-12 


VH2-31-10 


18 


2 


11-5; V2-5 


VH2-31-11 


2 


2 


VH2S12-2; VH2S12-8 


VH2-31-12 


2 


2 


VH2S12-4; VH2S12-6 


VH2-31-13 


2 


2 


VH2S12-14 


VH3-1 1-1 


13 


3 


v65-2; DP44 


VH3-1 1-2 


19 


3 


DP45 


VH3-1 1-3 


3 


3 


13-2; DP48 


VH3-1 1-4 


19 


3 


DP52 


VH3-11-5 


14 


3 


v3-13 


VH3-11-G 


19 


3 


DP42 


VH3-11-7 


3 


3 


8-1 B;YAC-5; 3-66 


VH3-11-8 


14 


3 


V3-53 


VH3-13-1 


3 


3 


22-2B;DP35;V3-11 


VH3-13-5 


19 


3 


DP59;VHl9;V3-35 


VH3-13-6 


25 


3 


n-pl; DP61 


VH3-13-7 


19 


3 


DP46; GL-SJ2; COS 8; hv3005 


VH3-13-8 


24 


3 


VH26 


VH3-13-9 


5 


3 


vh26c 


\ /LI *"> 1 O * O 

VH3-13-10 


19 


3 


DP47; VH26; 3-23 


VH3-13-1 1 


3 


3 


1-91 


VH3-13-12 


19 


3 


DP58 


\ /LI 1 t 1 1 o 

VH3-13-13 


3 


3 


1 —SHI; DP49; 3-30; 3d28.1 


VH3-13-14 


24 


3 


3019B9; DP50; 3-33; 3d277 


VH3-13-15 


27 


3 


COS 3 


VH3-13-16 


19 


3 


DP51 


VH3-13-17 


16 


3 


Hi 1 


VH3-13-18 


19 


3 


DP53;C0S6;3-74;DA-8 


VH3-13-19 


19 


3 


DP54;VH3-11;V3-7 


VH3-13-20 


14 


3 


V3-64; YAC-6 


VH3- 13-21 


14 


3 


V3-48 


VH3- 13-22 


14 


3 


V3-43; DP33 


VH3-13-23 


14 


3 


V3-33 
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Table 1C: (continued) 



Used Name' 


Reference 2 


Family' 


Germiine genes* 


VH3- 13-24 


14 


3 


V3-21; UP77 


VH3-13-25 


14 


3 


V3-20; DP32 


VH3-13-26 


14 


3 


V3-9; DP31 


VH3-14-1 


3 


3 


12-2; DP29; 3-72; DA- 3 


VH3-14-4 


7 


. 3 


YAC-9; 3-73; MTGL 


VH3-14-2 


4 


3 


VHD26 


VH3-14-3 


19 


3 


DP30 


VH3-1X-1 


1 


3 


LSG8.1; LSG9.1; LSG10.1; HUM12IGVH; HUM13IGVH 


VH3-1X-2 


1 


3 


LSG11.1;HUM4IGVH 


VH3-1X-3 


3 


3 


9-1; DP38; LSG7.1; RCG1.1; LSGl.1; LSG3.1; LSG5.1; 








nUMlbluVn; HUMzluVn, nUMyiuVH 


VH3-1X-4 


1 


3 


I CP A 1 


VH3-1X-5 


1 


3 


LMj2.1 


VH3-1X-6 


1 


** 
3 


l_>ob.1; HUMlOIbVn 


VH3-1X-7 


18 


*> 

3 


3-15, V3-15 


VH3-1X-8 


1 


3 


Lbbl2.l; HUMSIbVH 


VH3-1X-9 


1 A 

14 


i 
J 


\/i /id 


VH4-1 1-1 


22 


4 


Ton MIX A n 1 


VH4-1 1-2 


17 


A 

4 


\/U/l *>1 • riDCO* V/UC* Ar\l&' \/A 

Vn4.zl, UrbJ, Vnb, 40/b, vh-jh 


VH4-11-3 


23 


4 


4.44 


VH4-1 1-4 


23 


4 


4.44.3 


VH4-11-5 


23 


4 


4.36 


VH4-11-6 


23 


4 


4.37 


VH4-11-7 


18 


4 


l\/ A . A OH . \ / A A 

IV-4; 4.35; V4-4 


VH4-1 1-8 


1 7 


A 

4 


Vn4.1 1 , Juiy/0, Ur/ I f DopZ 


VH4-1 1-9 


20 


a 

4 


II -7 


VH4-1 1-10 


20 


4 


uo 
no 


VH4-11-11 


20 


A 

4 


ny 


VH4-11-12 


17 


4 


VH4.16 


VH4-11-13 


23 


4 


4.38 


VH4-11-14 


17 


4 


VH4.15 


VH4-11-15 


1 1 


4 


58 


VH4-11-16 


10 


4 


71-4;V4-59 


VH4-21-1 


11 


4 


11 


VH4-21-2 


17 


4 


VH4.17; VH4.23; 4d255; 4.40; DP69 


VH4-21-3 


17 


4 


VH4.19;79; V4-4b 
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Table 1C: (continued) 



UScQ INdMlC 




Familv 3 

i ai 1 1 1 1 y 


Germline acnes 4 


VH4-21-4 


19 


4 


DP70; 4d68;4.41 


VH4-21-5 


19 


4 


DP67;VH4-4B 


VH4-21-6 


17 


4 


VH4.22;VHSP; VH-JA 


VH4-21-7 


17 


4 


VH4.13; 1-911; 12G-1; 3d28d; 4.42; DP68;4-28 


VH4-21-8 


26 


4 


hv4005; 3d24d 


ViTt-i 1 J 


17 


4 


VH4.14 


\fUA 7 1 _ 1 
VF14-J i 1 




4 


4 34* 3d230d' DP78 


\/LM 7 1 7 

Vn4-J I -z 


77 
zo 




414 7 


\/UA 7 1 7 


1 Q 

I j 


4 




\ /Li y| ")1 A 

VH4-J 1 -4 


1 Q 

i y 


H 


HPK^* 4-71 * 7H777d 


\/U A 7 1 £ 


77 
ZO 




4 77* 7H7RH 






4 


H10 


V/HA 71-7 
Vrl4- J * / 




4 


H 1 1 


\/HA 71 -ft 


77 


4 


4.31 


\/MA 71-Q 


71 

z. j 


4 


4.32 


\/ha-7 1 - i n 


70 


4 


3d277d 


\/WA 71 11 
Vr14- J f - II 


70 


4 


ld716d 


\/UA 71 17 
Vn*t"j 1 - 1 Z 


70 


A 
*r 


ld77Qd 

Jul / JU 


VH4-31-13 


17 


4 


VH4.18; 4d 1 54, Ur/y 


VH4-31-14 


8 


4 


V4-39 


VH4-31-15 


11 


4 


2-1;DP79 


VH4-31-1G 


23 


4 


4.30' 


VH4-31-17 


17 


4 


VH4.12 


VH4-31-18 


10 


4 


71-2; DP66 


VH4-31-19 


23 


4 


4.39 


VH4-31-20 


8 


4 


V4-61 


VH5-12-1 


9 


5 


VH251 ; DP73; VHVCW; 51-Rl ; VHVLB; VHVCH; VHVTT; 








VHVAU; VHVBLK; VhAU; V5-51 


VH5-12-2 


17 


5 


VHVJB 


VH5-12-3 


3 


5 


1-v; DP80; 5-78 


VH5-12-4 


9 


5 


VH32;VHVR6; VHVMW; 5-2R1 


VH6-35-1 


4 


6 


VHVI; VHG; VHVIIS; VHVITE; VHVIJB; VHVICH; VHViCW; 



VHVIBLK; VHVIMW; DP74; 6-lGl;V6-1 
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aa 2 Computed 
family 3 


Germline 
gene 4 


Diff. to 
germiine s 


% diff. to 
germline 6 


Reference 7 


III-3R 


108 1 


An 

08 


1 


1 "\0fn 

\, \y/o 


70 


No.86 


109 1 


08 


J 


7 7Qfr» 


80 


AU 


108 1 


08 


6 




103 


ROY 


108 1 


no 
Uo 


c 

D 




43 


IC4 


108 1 


Uo 


C 
O 




70 


HIV-B2G 


10G 1 


08 


o 
J 


7 70Jn 
J,ZtO 


8 


GRI 


108 1 


U8 


Q 
O 


ft 40/n 


30 


AG 


106 1 


08 


o 
o 




116 


REI 


108 1 


08 


9 




OO 


CLL PATIENT 16 


88 1 


08 


2 


Z,J u /0 


177 
1 zz 


CLL PATIENT 14 


87 1 


08 


z 


7 TO/ft 
Z f J u /0 


177 

1 CC 


CLL PATIENT 15 


88 1 


U8 


Z 


- 7 70/n 


122 


GM4672 


108 1 


U8 


1 n 
1 I 


1 1 ( D /U 


24 


HUM. YFC51.1 


108 1 


08 


1 2 


1 Z,b u /0 


no 


LAY 


108 1 


08 


1 1 


17 K0/n 


48 


HIV-013 


106 1 


no 
08 


Q 


Q 70/n 




MAL-NaCI 


108 1 


08 


13 


1 *> 7Q/« 
1 J f /Hf0 


107 
IUZ 


STRAb SA-1A 


108 1 


02 


0 


U f u u /0 


1 70 


HuVHCAMP 


108 1 


08 


13 


1 o f / u /o 


inn 


CRO 


108 1 


02 


10 






Am 107 


108 1 


02 


12 


1 1 ecu*. 


IUO 


WALKER 


107 1 


02 


4 


4,Z u /0 


D/ 


III-2R 


109 1 


A20 


0 






F0G1-A4 


107 1 


A20 


4 


a on/A 


41 


HK137 


95 1 


Li 


0 


n no/ft. 


in 


CEA4-8A 


107 1 


02 


7 




A1 


Va" 


95 1 


L4 


0 


u,u u /o 


QO 


TR1.21 


lUo I 




4 


4,2% 


92 


HAU 


108 1 


H7 


c 

D 


6 f 3% 


123 


HK102 


95 1 


L12(1) 


0 


0.0% 


9 


H20C3K 


108 1 


L12(2) 


3 


3 f 20/o 


125 


CHEB 


108 1 


02 


7 


7,4% 


5 


HK134 


95 1 


LI 5(2) 


0 


0,0% 


10 


TEL9 


108 1 


02 


9 


9,5% 


73 






53 
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Name' 


aa 2 Computed 
family 1 


Germline 
gene 4 


Diff. to 
germline 5 


o/o diff, to 
germline 6 


Reference' 


TR1.32 


i m i 


uz 


o 




92 


RF-KES1 


97 1 


MZU 


A 




121 


WES 


108 I 




in 
1 u 




61 


DILpI 


95 1 




i 

i 


1 , 1 /u 


70 


SA-4B 


107 1 


1 1 of ot 

L12I2J 


o 
o 


ft AO/ft 


1 JLVJ 


HK101 


95 1 


i i r f 1 \ 

L15(1) 


0 


U,U u /0 


Q 


TR1.23 


108 1 


U2 


r 
D 


c on/« 


Q7 

yz 


HF2-1/17 


108 1 


A30 


0 


0,0% 


4 


2E7 


108 1 


A30 


1 


1,1% 


bz 


33.C9 


107 1 


L12(2) 


7 


7 t 4% 


1 ZD 


3D6 


105 1 


L12(2) 


2 


o i rv. 

2,1 D /o 




I-23 


108 1 


L8 


8 


8,4% 


/U 


RF-KL1 


97 1 


L8 


4 


a on/. 

4,2% 


1 zl 


TNF-E7 


108 1 


A30 


9 


9,5% 


A 1 

41 


TR1.22 


108 1 


02 


7 


7,4% 


no 
92 


HIV-B35 


106 1 


02 


2 


o on/. 

2,2% 


0 
0 


HIV-D22 


106 1 


02 


2 


2,2% 


8 


HIV-b27 


106 1 


02 


2 


2,2% 


8 


HIV-B8 


107 1 


02 


10 


1 0,8% 


8 


HIV-b8 


107 1 


02 


10 


10,8% 


8 


RF-SJ5 


95 1 


A30 


5 


5,3% 


1 1 o 

1 13 


GAL(I) 


108 1 


A30 


6 


in/ 

6,3% 


C A 

64 


R3.5H5G 


108 1 


02 


6 


6,3% 


70 


HIV-b14 


106 1 


A20 


2 


O Oft/. 

2,2% 


0 
0 


TNF-E1 


105 1 


15 


8 


o t 4% 


41 


WEA 


108 1 


A30 


o 

8 


o,4 u A) 


"5*7 
J/ 


EU 


108 1 


1 1 o ( o\ 

L12(2) 


5 




A C\ 

4U 


FOG1-G8 


108 1 


L8 


1 1 


1 1,6% 


41 


1X7RG1 


108 1 


LI 


8 


8,40/0 


70 


BLI 


108 1 


L8 


3 


3 ( 20to 


72 


KUE 


108 1 


L12(2] 


11 


11,6% 


32 


LUNmOl 


108 1 


LI 2(2) 


10 


10.5% 


6 


HiV-bl 


106 1 


A20 


4 


4.30A) 


8 


HIV-S4 


103 1 


02 


2 


2 ( 20/o 


8 
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Table 2A: (continued) 



Nlamp 1 


aa 2 


Computed 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


% diff. to 
germline 6 


Reference 


CAR 


107 


1 


l i if *l\ 

LI 2(2) 


1 1 


1 1 IQJn 


79 


BR 


107 


1 




1 1 
1 1 


1 1 , U fU 


50 


CLL PATIENT 10 


no 

Bo 


1 


n? 
uz 


n 


0,0% 


122 


CLL PATIENT 12 


oo 
oo 






o 


0,0% 


122 


KING 


luo 




1 12(21 


12 


12,6% 


30 


V13 


3D 




1 94 

LZH 


o 


0,0% 


46 


CLL PATIENT 1 1 


87 




uz 


n 


0,0% 


122 


CLL PATIENT 13 


87 


1 


Uz 


n 
u 




122 


CLL PATIENT 9 


88 


1 


012 


l 


1 , 1 u /0 


197 

1 zz 


HIV-B2 


106 


1 


A20 


9 


9,7% 


Q 
0 


HIV-b2 


106 


1 


A20 


9 


9,7% 


o 
o 


CLL PATIENT 5 


88 


1 


A20 


1 


1 ,1% 


1 ZZ 


CLL PATIENT 1 


88 


1 


L8 


2 


1 Tfl/U 


1 9? 
1 ZZ 


CLL PATIENT 2 


88 


1 


L8 


0 


n aoju 
U.UWO 


1 99 
1 ZZ 


CLL PATIENT 7 


88 


1 


L5 


0 




199 
1 ZZ 


CLL PATIENT 8 


88 


1 


L5 


0 


A AftJU 


199 
1 ZZ 


HIV-b5 


105 


1 


L5 


1 1 


12,0% 


Q 

O 


CLL PATIENT 3 


87 


1 


L8 


1 


1,1% 


122 


CLL PATIENT 4 


88 


1 


L9 


0 


0,0% 


122 


CLL PATIENT 18 


85 


1 


L9 


6 


7,1% 


< *i i 
122 


CLL PATIENT 17 


86 


1 


LI 2(2) 


7 


8,1% 


1 zz 


HIV-020 


107 




A27 


1 1 


1 1 ,7% 


o 
o 


2C12 


108 


1 


L12(2) 


20 


. 21, 1% 


CO 

bo 


1B11 


108 


1 


LI 2(2) 


20 


21,1% 


CO 

bo 


1H1 


108 


1 


Ll2{2) 


21 


22,1% 


CO 

bo 


2A12 


108 


1 


Li 2(2} 


21 


22, 1 % 


CO 

bo 


CUR 


109 


3 


A27 


0 


A Afi/«. 


bb 


GLO 


109 


3 


A27 


0 


0,0% 


16 


RF-TSl 


96 


3 


A27 


0 


O.OO/o 


121 


GAR' 


109 


3 


A27 


0 


0,0% 


67 


FLO 


109 


3 


A27 


0 


0,0% ' 


66 


PIE 


109 


3 


A27 


0 


0.0% 


91 


HAH 14.1 


109 


3 


A27 


1 


1,0% 


51 


HAH 14.2 


109 


3 


A27 


1 


1,0% 


51 
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Name' 


aa' 


Computed 


Germline 


Diff. to 


o/o diff. to 


Refererv 






family 3 


gene 4 


germline 5 


germline 6 




HAH 16.1 


109 


3 


A27 


1 


1 ,u°/o 


CI 
3 1 


NOV 


109 


3 


A27 


1 


1 ,0°/o 


CO 


33.F1 2 


108 


3 


A27 


1 


1 ,0% 


l ZD 


8E10 


110 


3 


A27 


1 


1 ,0% 


25 


TH3 


109 


3 


A27 


1 


1 ,0% 


25 


HIC (R) 


108 


3 


A27 




0,0% 


51 


SON 


110 


3 


A27 


1 


1 ,0% 


67 


PAY 


109 


3 


A27 


1 


1 ,0% 


66 


GOT 


109 


3 


A27 


1 


1 ,0% 


67 


mAbA6H4C5 


109 


3 


A27 


1 


1 ,0% 


12 


BOR' 


109 


3 


A27 


2 


2,1% 


84 


RF-SJ3 


96 


3 


A27 


2 


.... 2,1% 


121 


SIE 


109 


3 


A27 


2 


2,1% 


15 


ESC 


109 


3 


A27 


2 


2,1% 


98 


HEW 


110 


3 


A27 


2 


2,1% 


98 


YES8c 


109 


3 


A27 


3 


3,1% 


33 


Tl 


109 


3 


A27 


3 


3,1% 


114 


mAb113 


109 


3 


A27 


3 


3,1% 


71 


HEW 


107 


3 


A27 


0 


0,0% 


94 


BRO 


106 


3 


A27 


0 


0,0% 


94 


ROB 


106 


3 


A27 


0 


0,0% 


94 


N69 


96 


3 


A27 


4 


4,2% 


11 


NEU 


109 


3 


A27 


4 


4,2% 


66 


WOL 


109 


3 


A27 


4 


4,2% 


2 


35G6 


109 


3 


A27 


4 


4,2% 


59 


RF-SJ4 


109 


3 


A1 1 


0 


0,0% 


88 


KAS 


109 


3 


A27 


4 


4,2% 


84 


BRA 


106 


3 


A27 


1 


1,1% 


94 


HAH 


106 


3 


A27 


1 


1,1% 


94 


HIC 


105 


3 


A27 


0 


0,0% 


94 


FS-2 


109 


3 


A27 


6 


6,3% 


87 


JH' 


107 


3 


A27 


6 


6,3% 


38 


EV1-15 


109 


3 


A27 


6 


6,3% 


83 


SCA 


108 


3 


A27 

5 <r 


6 


6 ( 30/o 


65 
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aa J 


Computed 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


% diff. to 
germline 6 


Reference 1 


mAbl!2 


109 


3 


A27 


6 


6,3°/o 


71 


SIC. 


103 


3 


A27 


3 


3,3% 


QA 


SA-4A 


109 


3 


A27 


6 


6,3% 


1 20 


SER 


108 


3 


A27 


6 


6,3% 


98 


GOL' 


109 


3 


A27 


7 


7,3% 


on 

82 


B5G10K 


105 


3 


A27 


9 


9,7% 


1 ZD 


HG2B10K 


110 


3 


A27 


-9 


9,4% 


1 25 


Taykv322 


105 


3 


A27 


5 


5,4°/o 


CO 


CLL PATIENT 24 


89 


3 


Ml 


1 


i in/ 
1,1% 


100 


HIV-b24 


107 


3 


A27 


1 


/ ,4 u /0 


0 


HIV-b6 


107 


3 


A27 


1 


7,4% 


8 


Taykv310 


99 


3 


A27 


1 


1,1% 


52 


KA3D1 


108 


3 


L6 


0 


0,0% 


85 


19.E7 


107 


3 


L6 


0 


0,0% 


126 


rsvGL 


109 


3 


A27 


12 


12,5% 


7 


Taykv320 


98 


3 


A27 


1 


1,2% 


52 


Vh 


96 


3 


LI 0(2) 


0 


0,0% 


89 


LS8 


108 


3 


L6 


1 


1,1% 


109 


LSI 


108 


3 


L6 


1 


1,1% 


109 


LS2S3-3 


107 


3 


L6 


2 


2,1% 


99 


LS2 


108 


3 


L6 


1. 


1,1% 


109 


LS7 


108 


3 


L6 


1 


1,1% 


109 


LS2S3-4d 


107 


3 


16 


2 


2,1% 


99 


LS2S3-43 


107 


3 


L6 


2 


2,1% 


99 


LS4 


108 


3 


L6 


1 


1,1% 


109 


LSG 


108 


3 


L6 


1 


1,1% 


109 


LS2S3-10a 


107 


3 


L6 


2 


2.1% 


99 


LS2S3-8C 


107 


3 


L6 


2 


2,1% 


99 


LS5 


108 


3 


L6 


1 


1,1% 


109 


LS2S3-5 


107 


3 


L6 


3 


3,2% 


99 


LUNm03 


109 


3 


A27 


13 


13,5% 


6 


IARC/BL4 1 


108 


3 


A27 


13 


13,70/o 


55 


Slkv22 


99 


3 


A27 


3 


3,5% 


13 


POP 


108 


3 


L6 


4 


4,2% 


111 



5 * 

SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2A: (continued) 



PCT/EP96/03647 



Name' 


aa 2 


Computed 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


% diff. to 
germline 6 


Referen 


LS2S3-10b 


107 


3 


L6 


3 


3,2% 


99 


LS2S3-8f 


107 


3 


L6 


3 


3.2% 


99 


LS2S3-12 


107 


3 


L6 


3 


3.2% 


99 


HIV-B30 


107 


3 


A27 


11 


11.7% 


8 


HIV-B20 


107 


3 


A27 


11 


11.7% 


8 


HIV-b3 


108 


3 


A27 


11 


11.7% 


8 


HIV-S6 


104 


3 


A27 


9 


9.9% 


8 


YSE 


107 


3 


L2/L16 


1 


1.1% 


72 


POM 


109 


3 


L2/L16 


9 


9,4% 


53 


Humkv328 


95 


3 


L2/L16 


1 


1.1% 


19 


CLL 


109 


3 


L2/L16 


3 


3,2% 


47 


LES 


96 


3 


L2/L16 


3 


3.20/0 


38 


HIV-S5 


104 


3 


A27 


11 


12,1% 


8 


HIV-S7 


104 


3 


A27 


11 


12.1% 


8 


slkvl 


99 


3 


A27 


7 


8.1% 


13 


Humka31es 


95 


3 


L2/L16 


4 


4.2% 


18 


slkvl 2 


101 


3 


A27 


8 


9.2% 


13 


RF-TS2 


95 


3 


L2/L16 


3 


3.2% 


121 


11-1 


109 


3 


L2/L16 


4 


4.2% 


70 


HIV-S3 


105 


3 


A27 


13 


14,3% 


8 


RF-TMC1 


96 


3 


16 


10 


10,5% 


121 


GER 


109 


3 


L2/L16 


7 


7.4% 


75 


GF4/1.1 


109 


3 


L2/L16 


8 


8.40/o 


36 


mAb114 


109 


3 


L2/L16 


6 


6.30/0 


71 


HIV-loop13 


109 


3 


L2/L16 


7 


7,4% 


8 


bkv16 


86 


3 


L6 


1 


1,2% 


13 


CLL PATIENT 29 


86 


3 


L6 


1 


1,20/o 


122 


slkv9 


98 


3 


L6 


3 


3,5% 


13 


bkv17 


99 


3 


L6 


1 


1.20/0 


13 


slkvl 4 


99 


3 


L6 


1 


I.20/0 


13 


slkvl e 


101 


3 


L6 


2 


2,3% 


13 


bkv33 


101 


3 


L6 


4 


4.70/o 


13 


slkvl 5 


99 


3 


L6 


2 


2.30/o 


13 


bkv6 


100 


3 


L6 


3 


3.50/o 


13 



58 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2A: (continued) 



PCT/EP96/03647 



Name 1 aa' Computed Germline Diff. to % diff. to Reference 7 







family 3 


gene 4 


germline s 


germline 




RGB8K 


i no 
lUo 


o 


1 9/1 1 K 
LZ/L I D 


1 z 




125 


AL 700 


1U/ 


o 
J 


| O/l ic 
LZ/L 1 D 


Q 


9,5% 


117 


slkv! 1 


IUU 


o 
O 


LZ /LID 


9 

o 


3,5% 


13 


— 11— . A 

SIKV4 


Q7 


O 




4 


4,8% 


13 


CLL PATitNl Zb 


0/ 


0 
J 


I oil i c 

LZ/L 1 D 


1 


1,1% 


122 


AL Se124 


1 no 
1UJ 


o 

s> 


19/1 1 C 
LZ/L 1 D 


q 




117 


slkv13 


1 nn 
IUU 


O 


LZ/L 1 D 


c 

D 


1 0% 


13 


bkv7 


1 nn 
IUU 


•> 


19/1 1 C 
LZ/L 1 D 


c 




13 


bkv22 


1 nn 
IUU 


J 


LZ/L 1 b 


C 
D 


7 OOJh 


13 


CLL PATIENT 27 


84 


3 


1 O /I 1 c 

Lz/Llb 


n 
u 




122 


bkv35 


100 


3 


Lb 


o 
o 


Q 90Ai 


1 1 


CLL PATIENT 25 


87 


3 


L2/L16 


4 


X CD/a 

4,b u /o 


1 99 
I ZZ 


slkv3 


86 


3 


L2/L16 


/ 


Q 1 DJrt 

O, i u /o 


1 9 


slkv7 


99 


1 


02 


1 


0, 1 U /D 


1 9 
1 o 


HuFd79 


111 


3 


L2/L16 


24 


Z4 t Z u /0 


91 

Z 1 


RAD 


99 


3 


A27 


9 


1 n ooju 


7R 
/o 


CLL PATIENT 28 


83 


3 


L2/L16 


4 


4 t O u /D 


1 99 
1 ZZ 


REE 


104 


3 


L2/L16 


ZD 


97 90A\ 


QH 


FR4 


99 


3 


AIT 

Az7 


Q 

o 


Q 90/n 


77 
/ / 


MD3.3 


92 


3 


Lb 


1 
I 


1 90Ai 


HA 


MD3.1 


92 


3 


1 c 

Lb 


U 




H4 


GA3.6 


92 


3 


Lb 


o 
Z 




HA 


M3.5N 


92 


3 


Lb 


o 


9 flOJh 


H4 


WEI* 


82 


3 


A O 7 

Az/ 


U 


u,u u /o 


fiH 

UJ 


MD3.4 


92 


3 


L2/L16 


1 




HA 


MD3.2 


91 


3 


L6 


3 




HA 


VFR 


97 


3 


A27 


19 


22.40A) 


20 


CLL PATIENT 30 


78 


3 


L6 


3 


3.8% 


122 


M3.1N 


92 


3 


L2/116 


1 


1,3% 


54 


MD3.6 


91 


3 


L2/L16 


0 


0,0% 


54 


MD3.8 


91 


3 


L2/L1S 


0 


0,0% 


54 


GA3.4 


92 


3 


L6 


7 


9 t 0% 


54 


M3.GN 


92 


3 


A27 


0 


0,0% 


54 


MD3.10 


92 


3 


A27 


0 


0,0% 


54 

















SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2A: (continued) 



PCT/EP96/03647 



Name 1 


aa 2 


Computed 
family 3 


Germline 
gene* 


Diff. to 
germline 5 


o/o diff. to 
germline 6 


Reference 7 


MD3.13 


91 


J 


Az/ 


U 


U,U u /o 




MD3.7 


93 


3 


A27 


0 


0,0°/o 


TLA 


MD3.9 


93 


3 


A27 


0 


0,0% 


54 


GA3.1 


93 


3 


A27 


6 


7,6% 


54 


bkv32 


101 


3 


A27 


5 


5,7% 


13 


GA3.5 


93 


3 


A27 


5 


6,3% 


54 


GA3.7 


92 


3 


A27 


_1 


8,9% 


54 


MD3.12 


92 


3 


A27 


2 


2,5% 


54 


M3.2N 


90 


3 


L6 


6 


7,8% 


54 


MD3.5 


92 


3 


A27 


1 


1,3% 


54 


M3.4N 


91 


3 


L2/L16 


8 


10,3% 


54 


M3.8N 


91 


3 


L2/L16 


7 


9.0% 


54 


M3.7N 


92 


3 


A27 


3 


3,8% 


54 


GA3.2 


92 


3 


A27 


9 


11,4% 


54 


GA3.8 


93 


3 


A27 


4 


5,1% 


54 


GA3.3 


92 


3 


A27 


8 


10,1% 


54 


M3.3N 


92 


3 


A27 


5 


6,3% 


54 


66 


83 


3 


A27 


8 


11,3% 


78 


E29.1 KAPPA 


78 


3 


L2/L1 6 


0 


0,0% 


22 


sew 


108 


1 


08 


12 


12,6% 


31 


REI-based CAMPATH-9 


107 


1 


08 


14 


14,7% 


39 


RZ 


107 


1 


08 


14 


14,7% 


50 


Bl 


108 


1 


08 


14 


1 4,7% 


14 


AND 


107 


1 


02 


13 


13,7% 


69 


2A4 


109 


1 


02 


12 


12,6% 


23 


KA 


108 


1 


08 


19 


20,0% 


107 


MEV 


109 


1 


02 


14 


14,7% 


29 


DEE 


106 




02 


13 


14,0% 


76 


OU(IOC) 


108 




02 


18 


18,9% 


60 


HuRSVl9VK 


111 




08 


21 


21.0% 


115 


SP2 


108 




02 


17 


17,9% 


93 


BJ26 


99 




08 


21 


24,1 o/o 


1 


Nl 


112 




08 


24 


24,2% 


106 


BMA 0310EUCIV2 


106 




L12(1) 


21 


22,3% 


105 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 



PCT7EP96/03647 



Table 2A: (continued) 



Name 1 


aa 2 


Computed 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


% diff. to 
germline 


Referenc 


CLL PATIENT 6 


71 


1 
1 


AZU 


n 

u 


n 00/n 


122 


BJ19 


85 


1 


Uo 


1 D 


?1 QQfo 


1 


GM 607 


113 


2 


AJ 


u 




58 


R5A3K 


114 


2 


A") 

AJ 






125 


R1C8K 


114 


2 


AJ 


i 
i 


1 O0/n 


125 


VK2.R149 


113 


2 


A3 


Z 


Z f U*V0 


118 


TR1.6 


109 


2 


A3 


4 


4,U u /0 


Q? 


TR1.37 


104 


2 


A3 


5 


b.UHto 


3Z 


FS-1 


113 


2 


A3 


6 


b f U u /o 


R7 
of 


TR1.8 


110 


2 


A 1 

A3 


b 


D.LrTU 


92 


NIM 


113 


2 


AJ 


Q 

o 


R 00/n 


28 


Inc 


112 


2 


A O 

A3 


1 1 


1 1 no/n 


35 


TEW 


107 


2 


A T 
AJ 


b 


K AO/n 
D,*t*vU 


96 


CUM 


1 14 


*> 
2 


Hi 
U 1 


7 




44 


1 1 Of" 1 

HRF1 


7 1 
/ 1 


z 


r\0 


4 


5,6% 


124 


CLL PATItNl 19 


Q7 

0/ 


Z 


A7 


0 


0.0% 


122 


CLL PATIENT 20 


87 


2 


AO 

A3 


n 
U 


U,UtD 


1?2 


MIL 


112 


2 


A "> 

A3 


1 b 


i b.zyro 


ZD 


FR 


113 


2 


A3 


ZU 


zu,u u /o 


mi 


MAL-Urine 


83 


1 


U2 


b 


ft COJ^ 


in? 


Taykv306 


73 


3 


Az7 


l 


1 fiO/n 
1 ,D u /0 




Taykv3l2 


75 


3 


A O "J 

A27 


1 


1 CO/n 
1 ,D u /0 


oz 


HIV-b29 


93 


3 


A27 


1 4 


1 7 £0/r> 


Q 

o 


1-185-37 


1 10 


3 


A27 


u 


n no/n 


1 19 


1-187-29 


110 


*> 
3 


All 

Az7 


U 


n no/n 


1 19 


TT1 17 


110 


3 


AzV 


Q 


Q 40/n 




HIV-loop8 


108 


3 


AZ/ 


1 0 


1G RQ/ n 


8 


rsv23L 


108 


3 


A*57 

Az / 


1 c 


1 R RQ/h 

1 0,0 rtJ 


7 


HIV-b7 


107 


3 


A27 


14 


14,9% 


8 


HIV-bll 


107 


3 


A27 


15 


16,0% 


8 


HIV-LC1 


107 


3 


A27 


19 


20,2% 


8 


HIV-LC7 


107 


3 


A27 


20 


21,3% 


8 


HIV-LC22 


107 


3 


A27 


21 


22,3% 


8 


HIV-LC13 


107 


3 


A27 


21 


22,3% 


8 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2A: (continued) 



PCT/EP96/03647 



Name 1 


aa J 


Computed 
family 3 


Germline 
gene" 


Diff. to 
germline 5 


o/o diff. to 
germline 6 


Referenc 


HIV-LC3 


1U7 


J 


All 
AZ / 


z t 


ZZ, jtO 


a 

o 


HIV-LC5 


107 


3 


A27 


Z 1 


11 "JO//* 


O 

o 


HIV-LC28 


107 


3 


A27 


21 


zz\J% 


Q 
O 


HIV-b4 


107 


3 


A27 


22 


*>"> /in/*. 


O 
O 


CLL PATIENT 31 


87 


3 


A27 


1 5 


17,z% 


1 11 

I ZZ 


HIV-loop2 


108 


3 


L2/L16 


17 


i / t y u /o 


n 
O 


HIV-loop35 


108 


3 


L2/L16 


17 


1 7,9% 


o 

O 


HIV-LC1 1 


107 


3 


A27 


23 


24,5°/o 


n 
0 


HIV-LC24 


107 


3 


A27 


23 


24,5% 


O 

o 


HIV-b12 


107 


3 


A27 


24 


25 r 5°/o 


8 


HIV-LC25 


107 


3 


A27 


24 


25,5% 


8 


HIV-D21 


107 


3 


A27 


24 


25,5% 


8 


HIV-LC26 


107 


3 


A27 


26 


27,7% 


8 


G3D10K 


108 


1 


LI 2(2) 


12 


12,6% 


125 


TT125 


108 


1 


L5 


8 


8,4% 


63 


HIV-S2 


103 


3 


A27 


28 


31,1% 


8 


265-695 


108 


1 


L5 


7 


7,4% 


3 


2-115-19 


108 


1 


A30 


2 


2,1% 


1 19 


rsv13L 


107 


1 


02 


20 


21,1% 


7 


HIV-618 


106 


1 


02 


14 


15,1% 


8 


RF-KL5 


98 


3 


L6 


36 


36,7% 


97 


ZM1-1 


113 


2 


A17 


7 


7,0% 


3 


HIV-S8 


103 


1 


08 


16 


1 7,8% 


8 


K- EV15 


95 


5 


B2 


0 


0,0% 


1 12 


RF-TS3 


100 


2 


A23 


0 


0,0% 


121 


HF-21/28 


1 1 1 


2 


Ait 

A17 


1 


1 ,0% 


17 


RPMI6410 


113 


2 


Al 7 


1 


1 ,0% 


42 


JC11 


113 


2 


A17 


1 


1,0% 


49 


0-81 


114 


2 


A17 


5 


5,0% 


45 


FK-001 


113 


4 


B3 


0 


0,0% 


81 


CD5+.28 


101 


4 


B3 


1 


1,0% 


27 


LEN 


114 


4 


B3 


1 


1,0% 


104 


UC 


114 


4 


B3 


1 


1,0% 


in 


CD5+.5 


101 


4 


B3 


1 


1,0% 


27 



<r=L 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2A: (continued) 



PCT/EP96/03647 



Name 1 


aa 2 


Computed 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


<Vb diff. to 
germline 6 


Reference 7 


CD5+.26 


lOl 


A 

4 


R7 
D 0 


i 
i 


1 fWn 


27 


CD5+.12 


101 


yl 
4 


R9 
Do 


9 




27 


CD5+.23 


101 


yl 

4 


RO 


o 

z 


9 O0/n 
z ,u w /u 


27 


CD5+.7 


lOl 


4 


R9 
DO 


z 


9 OO/n 


27 


VJI 


1 i o 
1 1 0 


A 

4 


R9. 
DO 


O 
0 


T OQ/n 


56 


LOC 


1 1 J 


yl 
4 


R9 
Do 


O 
0 


J,v 


72 


MAL 


1 1 3 


4 


RO 
Do 


O 
0 


1 fi0/n 


7? 


CD5+.6 


101 


A 

4 


RT 
DO 


o 

0 




77 


H2F 


1 1 0 

1 1 0 


4 


RT 
DO 


o 




70 


PB 1 71V 


1 1 A 

1 1 4 


4 


RT 
Do 


/I 


4 DOM 


74 


CD5+.27 


i n 1 
101 


4 


R** 
DO 


/l 
*+ 


4 OO/n 


27 


CD5+.9 


lUl 


4 


R1 
Do 




4 OO/n 


27 


CD5-.28 


101 


4 


RT 
Do 


c 
D 




77 


CD5-.26 


inn 
101 


>* 
4 


R9 
Do 


C 
D 


O t 3TfO 


77 


CD5+.24 


101 


4 


RO 
DO 


C 
O 


QO/n 


77 


CD5+.10 


101 


4 


DO 
DO 


b 


O, J u /0 


97 

Z f 


CD5-.19 


101 


4 


DO 
DO 


b 




97 
Z / 


CD5-.18 


101 


4 


D O 
DO 


/ 




97 
Z / 


CD5-.16 


101 


4 


B3 


8 


/ r y u A) 


Z / 


CD5-.24 


101 


4 


B3 


o 

8 


7 nni- 

/,y u /o 


O 7 
Z / 


CD5-.17 


101 


4 


B3 


10 


y f y°/o 


07 

z / 


MD4.1 


92 


4 


" DO 

do 


0 


n not 


o4 


MD4.4 


92 


>• 
4 


RO 
DO 


U 


n no/n 


o*+ 


MD4.5 


92 


4 


B3 


0 


0,0% 


54 


MD4.6 


92 


4 


B3 


0 


0,0% 


54 


MD4.7 


92 


4 


B3 


0 


0,0% 


54 


MD4.2 


92 


4 


B3 


1 


1 f 30/o 


54 


MD4.3 


92 


4 


B3 


5 


6,3% 


54 


CLL PATIENT 22 


87 


2 


A17 


2 


2,3% 


122 


CLL PATIENT 23 


84 


2 


A17 


2 


2.4% 


122 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2B: rearranged human lambda sequences 



PCT/EP96/03647 



Name' 


aa J 


Computed 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


o/o diff. to 
germline 6 


Reference 7 


WAH 


no 


1 


DPL3 


7 


-in/ 

7% 


CQ 

bo 


1B9/F2 


1 12 


1 


DPL3 


7 


7% 


y 


DIA 


112 


1 


DPL2 


7 


7°/o 


Jb 


mAb67 


89 


1 


DPL3 


0 


0% 


23 


HM2 


1 10 


1 


DPL3 


12 


1 1% 


n 
J 


NI6-77 


1 12 


1 


DPL2 


9 


9% 


11 


OKA 


112 


1 


DPL2 


7 


7% 


84 


KOL 


112 


1 


DPL2 


12 


1 1% 


40 


T2:C5 


111 


1 


DPL5 


0 


0% 


6 


T2.C14 


110 


1 


DPL5 


0 


(Wo 


6 


PR-TS1 


110 


1 


DPL5 


0 


0°/o 


55 


4G12 


1 1 1 


1 


DPL5 


1 


10/0 


35 


KIM46L 


112 


1 


HUMLV1 17 


0 


0% 


8 


Fog-B 


111 


1 


DPL5 


3 


3% 


31 


9F2L 


111 


1 


DPL5 


3 


3% 


79 


mAblll 


110 


1 


DPL5 


3 


3% 


48 


PH0X15 


111 


1 


DPL5 


4 


4% 


49 


BL2 


111 


1 


DPL5 


4 


40/o 


74 


NIG-64 


111 


1 


DPL5 


4 


4% 


72 


RF-SJ2 


100 


. 1 


DPL5 


6 


6<Vo 


78 


AL EZI 


112 


1 


DPL5 


7 


70/0 


41 


ZIM 


112 


1 


HUMLV1 17 


7 


7% 


18 


RF-SJ1 


100 


'■ 


DPL5 


9 


9% 


78 


IGLV1.1 


98 


1 


DPL4 


0 


0°/o 


1 


NEW 


112 


1 


HUMLV1 17 


1 1 


10% 


42 


CB-201 


87 


1 


DPL2 


1 


l°/o 


62 


MEM 


109 


1 


DPL2 


6 


6% 


50 


H210 


111 


2 


DPL10 


4 


40/0 


45 


NOV 


110 


2 


DPL10 


8 


8% 


25 


NEI 


1 1 1 


2 


DPL10 


8 


8% 


24 


AL MC 


110 


2 


DPL11 


6 


6% 


28 


MES 


112 


2 


DPL11 


8 


8% 


84 


FOG 1 -A3 


111 


2 


DPL11 


9 


9% 


27 


AL NOV 


112 


2 


DPL11 


7 


70/n 


28 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2B: (continued) 



PCT/EP96/03647 



Name 1 


aa ? 


Computed 
family 3 


Germline 
gene* 


Diff. to °/o diff. to 
germline 5 germline 6 


Referem 


HMST-1 


110 


2 


DPLl 1 


4 


4% 


ol 


HBW4-1 


108 


2 


DPL12 


9 


9% 


CO 


WH 


110 


2 


DPLl 1 


11 


11% 


J4 


11-50 


110 


2 


DPLl 1 


7 


7% 


tsz 


HBp2 


110 


2 


DPL12 


8 


80/o 


J 


NIG-84 


113 


2 


DPL1 1 


12 


11% 


77 


ViL 


112 


2 


DPLl 1 


9 


9% 


CQ 


TRO 


1 1 1 


2 


DPLl 2 


10 


10% 


0 1 


ES492 


108 


2 


DPL1 1 


15 


15% 


/ D 


mAb216 


89 


2 


DPL12 


1 


10/0 


/ 


BSA3 


109 


3 


DPL16 


0 


0% 


4y 


THY-29 


1 10 


3 


DPL1G 


0 


- - Wo 


27 


PR-TS2 


108 


3 


DPLl 6 


0 


0% 


cc 
bb 


E29.1 LAMBDA 


107 


3 


DPLl 6 


1 


1% 


1 1 
1 J 


mAbG3 


109 


3 


DPL16 


2 


2% 


9Q 

zy 


TEL 14 


110 


3 


DPLl 8 


6 


6% 




6H-3C4 


108 


3 


DPLl 6 


7 


70/o 




SH 


109 


3 


DPL1G 


7 


7% 


70 


ALGIL 


109 


3 


DPL1G 


8 


8% 


23 


H6-3C4 


108 


3 


DPL1G 


8 


80/o 


83 


V-lambda-2.DS 


111 


2 


DPL11 


3 


3o/o 


15 


8.12 ID 


110 


2 


DPL11 


3 


3% 


81 


DSC 


111 


2 


DPLl 1 


3 


3% 


56 


PV11 


110 


2 


DPLl 1 


1 


10/0 


56 


33.H11 


110 


2 


DPLl 1 


4 


40A) 


0 1 
81 


AS17 


1 1 1 


2 


DPL1 1 


7 


70/b 


bb 


SD6 


1 10 


2 


DPLl 1 


7 


70/0 


cc 

DO 


K53 


110 


2 


DPLl 1 


9 


9% 


5b 


PV6 


110 


2 


DPL12 


5 


5% 


56 


NGD9 


110 


2 


DPLl 1 


7 


70/0 


56 


MUCl-1 


111 


2 


DPLl 1 


11 


10% 


27 


A30c 




2 


DPL10 


6 


60/0 


56 


KS6 


110 


2 


DPLl 2 


6 


GO/o 


56 


TEL13 


111 


2 


DPLl 1 


11 


10% 


49 



4> 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 



PCT/EP96/03647 



Table 2B: 



(continued) 



MQITK*' 
INdlilC 


aa 2 


Pnmnurpd 

l ail ill y 


Gprmlinp 

\j nun i^. 

UVrllt. 


Diff. to % diff. to 
germline 5 germline 6 


Reference 7 


AS7 


no 


2 


DPI! 2 


6 


6% 


56 


MCG 


112 


2 


DPL12 


12 


11% 


20 


U266L 


110 


2 


DPL12 


13 


1 2% 


77 


PR-SJ2 


110 


2 


DPL12 


14 




55 


BOH 


112 


2 


DPL12 






37 


TOG 


111 


2 


DPL11 


1 Q 


1 flO/n 

1 0*70 


53 


TEL 16 


111 


2 


DPL11 


1Q 


1 RO/n 


49 


No. 13 


110 


2 


DPL10 


1 A 




52 


BO 


112 


2 


DPL12 


1 Q 
1 O 


1 / u /o 


80 


WIN 


112 


2 


DPL12 


1 / 




11 


BUR 


104 


2 


DPL12 


1 5 


1 5% 


46 


NIG-58 


110 


2 


DPL12 




1 Qft/~ 


69 


WEIR 


112 


2 


DPL1 1 




zo u /o 


21 


THY-32 


111 


1 


DPL8 


p 


roa* 


27 


TNF-H9G1 


111 


1 


DPL8 


Q 




27 


mAb6l 


111 


1 


DPL3 


1 


l u A) 


29 


LV1L1 


98 


1 


DPL2 


n 
u 


U*tO 


54 


HA 


113 


1 


DPL3 


1 A 


1 OVfO 


63 


LAI LI 


111 


1 


DPL2 


J 


J u /0 


54 


RHE 


112 


1 


DPL1 


i 7 


1 CQ/a 

1 D u /0 


22 


K1B12L 


113 


1 


• DPL8 


1 7 


1 D u /0 


79 


LOC 


113 


1 


DPL2 


1 D 


1 AO/n 


84 


NIG-51 


112 


1 


DPL2 


1 ? 
1 Z 


1 10/n 


67 


NEWM 


104 


1 


DPL8 




??0/n 


10 


MD3-4 


106 




DPL23 


14 


13% 


4 


COX 


112 


1 


DPL2 


13 


12% 


84 


HiHlO 


106 




DPL23 


13 


12% 


3 


VOR 


112 




DPL2 


16 


15% 


16 


ALPOL 


113 




DPL2 • 


16 


15% 


57 


CD4-74 


111 




DPL2 


19 


18% 


27 


AMYLOID MOL 


102 


3 


DPL23 


15 


15% 


30 


OST577 


108 


3 


Humlv318 


10 


10% 


4 


NIG-48 


113 


1 


DPL3 


42 


40% 


66 


CARR 


108 


3 


DPL23 


18 


17% 


19 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2B: (continued) 



PCT/EP96/03647 



Name 1 


aa 2 


Computed 


Germline 


Diff. to 


% diff. to 


Reference 7 




family 3 


gene 4 


germline 5 


germline 6 




mAbbO 


IUO 


l 


DPL23 


14 


13°/o 


29 


NIG-68 


yy 




DPI 91 


25 


26% 


32 


KERN 


1U/ 




npi 71 


76 


25% 


59 


ANT 


lUb 


J 


npi 71 




16% 


19 


LEE 


1 1 A 

1 1U 


•5 


npi 7i 


18 


17% 


85 


CLE 


y4 


J 


npi 71 


1 7 
i / 


17% 


19 


VL8 


Qft 


Q 

o 


DPL21 


o 


0% 


81 


MOT 


1 1fi 


1 


Humlvllfi 

1 1 U 1 1 1 1 V J 1 CI 


23 


22% 


38 


GAR 


1 wo 


1 


DPL23 


26 


25% 


33 


32.B9 


DO 


Q 

o 


DPL21 


5 


5% 


81 


PUb 


ma 


1 


Humlv3l8 


24 


23% 


19 


T1 


1 1 ^ 


Q 
o 


HUMLV801 


52 


50% 


6 


RF-TS7 


QC 

3D 


7 


DPI 1R 

L/l LtO 


4 


4% 


60 


YM-1 


1 1 b 


Q 
O 


nu ivilvovj i 


SI 


49% 


75 


K6HG 


1 1 L 


Q 
O 


HI IMIVR01 
nuiviLvou i 


20 

£-\J 


19% 


44 


K5C7 


1 1 z 


Q 
O 


hi iiv/ii v/am 




19% 


44 


K5B8 


119 
1 iz 


O 
0 


H11MIVR01 


20 


19% 


44 


K565 


1 1 Z 


O 
O 


H11M1VR01 

nUIVILvOV/ 1 


20 


19% 


44 


1/ A D O 

K4B8 


1 1 z 


Q 
O 


HUMLV801 

1 1 \J IVI L V 1 


19 


18% 


44 


Kbr5 


1 1 7 
1 1 z 


ft 

o 


HUMLV801 


17 


16% 


44 


tut 
H1L 


inn 


1 


DPI 91 


22 


21% 


47 


KIR 


i no 
i uy 


J 


npi 77 

Ur LZ J 


70 

z v/ 


19% 


19 


CAP 


1 HQ 


1 


npi 7i 

ur lz J 


13 


18% 


84 


1B8 


1 1U 


J 


npi 7T 


77 
zz 


71 Ofo 


43 




1 uo 




DPI 71 
L/r lzo 


19 


18% 


19 


HAN 


1 HQ 

lUo 


O 


HPI 71 
UrLZ J 


70 

zu 


19% 


. 19 




QR 




DPL23 


3 


3% 


12 


PR-SJ1 


96 


3 


DPL23 


7 


7% 


55 


BAU 


107 


3 


DPL23 


9 


9% 


5 


TEX 


99 


3 


DPL23 


8 


8% 


19 


X(PET) 


107 


3 


DPL23 


9 


9% 


51 


DOY 


106 


3 


DPL23 


9 


9% 


19 


COT 


106 


3 


DPL23 


13 


12% 


19 


Pag-1 


111 


3 


Humlv318 


5 


5% 


31 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2B: (continued) 



PCT/EP96/03647 



Name 1 


aa 2 


ComDuted 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


°A> diff to 
germline 6 


Reference 7 


DIS 


107 


3 


Hum!v318 


2 


20/o 


19 


WIT 


108 


3 


Humlv318 


7 


7% 


19 


I.RH 


108 


3 


Humlv318 


12 


lio/o 


19 


S1-1 


108 


3 


Humlv318 


12 


lio/o 


52 


DEL 


108 


3 


Humlv318 


14 


13% 


17 


TYR 


108 


3 


Humlv318 


11 


10% 


19 


J.RH 


109 


3 


Humlv318 


13 


12% 


19 


THO 


112 


2 


DPL13 


38 


36o/o 


26 


LBV 


113 


1 


DPL3 


38 


360/o 


2 


WLT 


112 


1 


DPL3 


33 


31% 


14 


SUT 


112 


2 


DPL12 


37 


350/o 


65 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2C: rearranged human heavy chain sequences 



PCT/EP96/03647 



Name' 


dd 


I dim iy 


(iPrmlinp 

vtl 11111*1^. 

npnp* 


Diff. to % diff. to 
germline 5 germline* 


Reference' 


21/28 


119 


1 


VH1-13-12 


0 


0,0% 


31 


8E10 


123 


1 


VHU13-12 


0 


0.0% 


31 


MUCl-1 


118 


1 


VH 1-1 3-6 


4 


4,1% 


42 


gFi 


98 


1 


VH1-13-12 


10 


10,2% 


75 


VHGL 1.2 


98 


1 


VH1-13-6 


2 


2,0% 


26 


HV1L1 


98 


1 


VH1-13-6 


0 


0,0% 


81 


RF-TS7 


104 


1 


VH 1-1 3-6 


3 


3,1% 


96 


E55 1.A15 


106 


1 


VH1-13-15 


1 


1,0% 


26 


HA1L1 


126 


1 


VH1-13-6 


7 


7.1% 


81 


UC 


123 


1 


VH 1-1 3-6 


5 


5.1% 


115 


WIL2 


123 


1 


VH1-13-6 


6 


6,1% 


55 


R3.5H5G 


122 


1 


VH1-13-6 


10 


10.2% 


70 


N89P2 


123 


1 


VH1-13-16 


1 1 


1 1 .2% 


77 


mAbl13 


126 


1 


VH1-13-6 


10 


10,2% 


71 


LS2S3-3 


125 


1 


VH1-12-7 


5 


5.1% 


98 


LS2S3-12a 


125 


1 


VH1-12-7 


5 


5.1% 


98 


LS2S3-5 


125 


1 


VH1-12-7 


5 


5.1% 


98 


LS2S3-12e 


125 


1 


VH1-12-7 


5 


5.1% 


98 


LS2S3-4 


125 


1 


VH1-12-7 


5 


5.1% 


98 


LS2S3-10 


125 


1 


VH1-12-7 


5 


5.1% 


98 


LS253-l2d 


125 


1 


VH1-12-7 


6 


6.1% 


98 


LS2S3-8 


125 


1 


VH1-12-7 


5 


5.1% 


98 


LS2 


125 


1 


VH1-12-7 


6 


6.1% 


113 


L54 


105 


1 


VH1-12-7 


6 


6.1% 


113 


LS5 


125 


1 


VH1-12-7 


6 


6.1% 


113 


LSI 


125 


1 


VH1-12-7 


6 


6.1% 


113 


LS6 


125 


1 


VH1-12-7 


6 


6.1% 


113 


LS8 


125 


1 


VH1-12-7 


7 


7.1% 


113 


THY- 29 


122 


1 


VH1-12-7 


0 


0.0% 


42 


1B9/F2 


122 




VH1-12-7 


10 


10.2% 


21 


51P1 


122 




VH1-12-1 


0 


0.0% 


105 


NEI 


127 




VH1-12-1 


0 


0.0% 


55 


AND 


127 




VH1-12-1 


0 


0.0% 


55 


L7 


127 




VH1-12-1 


0 


0.0% 


54 


122 


124 




VH1-12-1 


0 


0.0% 


54 


L24 


127 




VH1-12-1 


0 


0.0% 


54 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2C: (continued) 



PCT7EP96/03647 



Name' aa 2 Computed Germiine Diff. to °/o diff. to Reference 7 

family 3 gene 4 g^mline 5 germiine 6 



126 


116 


1 VH1 


- 


12-1 


0 


0.0% 


54 


□3 


119 


1 VH1 


- 


12-1 


0 


0.0% 


54 


L34 


117 


1 VH1 


1- 


12-1 


0 


0.0% 


54 


L36 


118 


1 VH1 


1- 


12-1 


0 


0.0% 


54 


L39 


120 


1 VH1 


- 


12-1 


0 


0.0% 


54 


141 


120 


1 VH1 


- 


12-1 


0 


0.0% 


54 


L42 


125 


1 VH1 


- 


12-1 


0 


0.0% 


54 


VHGL 1.8 


101 


1 VH1 


- 


12-1 


0 


0.0% 


26 


783c 


127 


1 VH1 


- 


12-1 


0 


0.0% 


22 


X17115 


127 


1 VH1 


- 


12-1 


0 


0.0% 


37 


125 


124 


1 VH1 


- 


12-1 


0 


0.0% 


54 


L17 


120 


1 VH1 


- 


12-1 


1 


1.0% 


54 


L30 


127 


1 VH1 


- 


12-1 


1 


1.0% 


54 


L37 


120 


1 VH1 


- 


12-1 


1 


1.0% 


54 


TNF-E7 


116 


1 VH1 


1- 


12-1 


2 


2.0% 


42 


mAblll 


122 


1 VH1 


- 


12-1 


7 


7.1°/b 


71 


III-2R 


122 


1 VH1 


- 


12-9 


3 


3.1% 


70 


KAS 


121 


1 VH1 


- 


12-1 


7 


7.10/0 


79 


YES8c 


122 


1 VH1 


- 


12-1 


8 


8.2% 


34 


RF-TS1 


123 


1 VH1 


- 


12-1 


8 


8,2% 


82 


BOR' 


121 


1 VH1 


- 


12-8 


7 


7,1% 


79 


VHGL 1.9 


101 


1 • VH1 


- 


12-1 


8 


8,2% 


26 


mAb410.30F305 


117 


1 VH1 


- 


12-9 


5 


5.1% 


52 


EV1-15 


127 


1 VH1 


-' 


12-8 


10 


10.2% 


78 


mAb112 


122 


1 VH1 


-' 


12-1 


11 


11.2% 


71 


EU 


117 


1 VH1 


-' 


12-1 


11 


11,2% 


28 


H210 


127 


1 VH1 




12-1 


12 


12.2% 


66 


TRANSGENE 


104 


1 VH1 




12-1 


0 


0,0% 


111 


CLL2-1 


93 


1 VH1 




12-1 


0 


0.0% 


30 


CLL10 13-3 


97 


1 VH1 




12-1 


0 


0,0% 


29 


LS7 


99 


1 VH1 




12-7 


4 


4,1% 


113 


ALL7-1 


87 


1 VH1 




12-7 


0 


0,0% 


30 


CLL3-1 


91 


1 VH1 




12-7 


1 


1,0% 


30 


ALL56-1 


85 


1 VH1 




13-8 


0 


0,0% 


30 


ALL1-1 


87 


1 VHl 




13-6 


1 


1,0% 


30 


ALL4-1 


94 


1 VH1 




13-8 


0 


0,0% 


30 



7*° 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCT7EP96/03647 



Name' 


aa 2 


Computed 


Gerrntine 


Diff. to 


% diff. to 


Referenci 




family 1 


gene 4 


germlme germlme 




ALL56 1 5-4 


85 


1 


1 /I J a « *\ O 

VH1- 13-8 


5 


5,1% 


79 


CLL4-1 


88 


1 


VH1-13-1 


1 


1,0% 




Au92.1 


98 


1 


VH1-12-5 


o 


0,0% 




RF-TS3 


120 


1 


VH1-12-5 


1 


1,0% 


Q7 


Au4.1 


98 


1 


VH1-12-5 


1 


1,0% 




HP1 


121 


' 


VHl-13-6 


13 


13,3% 


i in 


BLI 


127 


1 


VH1-13-15 


5 


5,1% 


77 


No. 13 


127 


• 1 


VH 1-1 2-2 


19 


1 9,4% 


/ O 


TR1.23 


122 


1 


VHl-13-2 


23 


23,5% 


QQ 
DO 


Sl-1 


125 


1 


VH1-12-2 


1R 


18,4% 


/ b 


TR1.10 


119 


1 


VH1-13-12 


1 4 


14,3% 




E55 1.A2 


102 


1 


VH1-13-15 


7 


3,1% 




SP2 


119 


1 


VHl-13-6 


1 C 
i J 


1 5,3% 




TNF-H9G1 


m 


1 


VH1-13-18 


7 


2.0% 


42 


G3D10H 


127 


1 


VH1-13-16 


19 


1 9,4% 


1 27 


TR1.9 


118 


1 


VH1-13-12 


14 


14,3% 


QQ 
OO 


TR1.8 


121 


1 


VH1-12-1 


74 


24,5% 


QQ 
OO 


LUNmOl 


127 


1 


VHl-13-6 


77 


77 4% 


Q 


K1B12H 


127 


1 


VH 1-1 2-7 


71 


23,5% 


177 
1 Li 


L3B2 


99 


1 


VHl-13-6 


0 


7 0% 


A C 

4o 


ss2 


100 


1 


VHl-13-6 


o 


7 0% 


A H 

46 


No.86 


124 


1 


VH1-12-1 


70 


70 40/n 


76 


TR1.6 


124 


1 


VH1-12-1 


1 Q 


19 4% 


88 


SS7 


99 


1 


VH1-12-7 


*> 
J 




46 


S5B7 


102 


1 


VH1-12-1 


n 
U 


n no/n 


46 


S6A3 


97 


1 


VH1-12-1 


n 


0,0% 


46 


ss6 


99 


1 


VHl-12-1 


0 


0,0% 


46 


L2H7 


103 


1 


VH1-13-12 


0 


0.0% 


46 


S6B68 


93 




VH 1-13-12 


0 


0.0% 


A C 
HO 


S6C9 


107 




VH1-13-12 


0 


0,0% 


46 


Ul\/ hA 


124 


; 


VH1-13-12 


21 


21,4% 


12 


HIV-b12 


124 




VH1-13-12 


21 


21,4% 


12 


L3G5 


98 




VHl-13-6 


1 


1,0% 


46 


22 


115 




VHl-13-6 


11 


1 1,20/o 


118 


L2A12 


99 




VH1-13-15 


3 


3,1% 


46 


PH0X15 


124 




VH1-12-7 


20 


20.4% 


73 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 . 
Table 2C: (continued) 



PCT/EP96/03647 



Name 


aa 2 


Computed 


Germline 


Diff. to 


% diff. to 


Referen 






family 3 


gene 4 


nprrrilinf*^ 
y iiiiif it 






LUNm03 


127 


1 


VH1-1X-1 


18 


18,4% 


9 


CEA4-8A 


129 


1 


VH1-12-7 


1 


l,0°/o 


42 


M60 


121 


2 


VH2-31-3 


3 


3.0% 


103 


HiHIO 


127 


2 


VH2-31-5 


9 


9,0% 


4 


COR 


119 


2 


VH2-31-2 


1 1 


1 1,0% 


91 


2-115-19 


124 


2 


VH2-31-11 


8 


8,1% 


124 


OU 


125 


2 


VH2-31-14 


20 


25,6% 


92 


HE 


120 


2 


VH2-31-13 


13 


1 Q OO/n 


77 


CLL33 40-1 


78 


2 


VH2-31-^ 


7 


7 OO/n 


9Q 
Z j 


E55 3.9 


88 


3 


VH3-1 1-S 

VI IJ 1 I J 


7 


7 70Jn 


ZD 


MTFC3 


125 


3 


VH3-14-4 


71 


7 1 oo/a 


1 7 1 


MTFCl 1 


125 


3 


VH3-14-4 


21 


71 OO/n 




MTFJl 


114 


3 


VH3-14-4 


71 

Z. I 




in 


MTFJ2 


1 14 


3 


VH3-14-4 


21 


71 OO/n 


1 O I 


MTFUJ4 


100 


3 


VH3-14-4 


21 


7 1 OO/n 


111 


MTFUJ5 


100 


3 


VH3-14-4 


21 


7 1 OO/n 


111 


MTFUJ2 


100 


3 


VH3-14-4 


22 


77 OO/n 




MTFC8 


125 


3 


VH3-14-4 




77 no/ft 




TD eVq 


1 13 


3 


VH3-14-4 


0 


o noi/n 


1 D 


rMTF 


1 14 


3 


VH1-14-4 

VI IJ | "T *T 


c 

D 


^ ho/a 


1 *> 1 


MTFUJ6 


100 


3 


VH3-14-4 


10 


i n oo/a 


I J t 


RF-KES 


107 


3 


• VH3-14-4 


9 




Oj 


N51P8 


126 


3 


VH3-14-1 


q 


Q OO/n 


77 


TEI 


119 


3 


VH3-13-8 


71 


71 40/n 


70 


33.H11 


115 


3 


VH3-13-19 


10 


1 0 70/n 


17Q 


SB1/D8 


101 


3 


VH3-1X-8 


14 


14 fWn 


7 


38P1 


119 


3 


VH3-1 1-3 


o 




104 


BRO'IGM 


1 19 


3 


VH3-1 1-3 


■ j 




1Q 


NIE 


119 


3 


VH3-13-7 


15 


15,3% 


87 


306 


126 


3 


VH3-13-26 


5 


J, 1 'U 


jj 


ZMl-1 


112 


3 


VH3-11-3 


8 


8,2% 


5 


E55 3.15 


110 


3 


VH3-13-26 


0 


0,0% 


26 


gF9 


108 


3 


VH3-13-8 


15 


1 5,3% 


75 


THY-32 


120 


3 


VH3-13-26 


3 


3,1% 


42 


RF-KL5 


100. 


3 


VH3-13-26 


5 


5,1% 


96 


OST577 


122 


3 


VH3-13-13 


6 


6,10/0 


5 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCT/EP96/03647 



Name' 


aa J 


Computed 
family 3 


Germline 
gene 4 


Diff. to % diff. to 
germline* germline 6 


Referenci 


BO 


113 


3 


Wt_J*> 1 "> 1 Q 


1 s 


1 5 f 3% 


10 


TT125 


121 


3 


VH3-13-10 


15 


1 5,3% 


fid 


2-115-58 


127 


3 


VH3-13-10 


1 1 


1 1 ,2% 


t O A 

124 


KOL 


126 


3 


VH3-13-14 


16 


1 6,3% 


102 


mAb60 


118 


3 


VH3-13-17 


14 


14,3% 


A C 

4b 


RF-AN 


106 


3 


VH3-13-26 


8 


8.2% 


Or 

85 


BUT 


115 


3 


VH3-11-6 


13 


13,4% 


1 19 


KOL-based CAM PATH- 














9 


118 


3 


VH3-13-13 


16 


16.3%- 




B1 


119 


3 


VH3-13-19 


13 


13.3% 




N98P1 


127 


3 


VH3-13-1 


13 


13.3% 


77 


TT117 


107 


3 


VH3-13-10 


12 


1 2.2% 




WEA 


114 


3 


VH3-13-12 


1 ^ 


1 5.3% 


a n 
4U 


HIL 


120 


3 


VH3-13-14 


14 


1 4,3% 


Z J 


S5A10 


97 


3 


VH3-13-14 


o 


0,0% 


4b 


S5D11 


98 


3 


VH3-13-7 


n 


OOQfo 


46 


SGC8 


100 


3 


VH3-13-7 


n 


OOQ/b 

\J,\J f\J 


46 


S6H12 


98 


3 


VH3-13-7 


u 




A £* 

46 


VHI0.7 


119 


3 


VH3-13-14 


1 D 


1 6,3% 


1 *>Q 

1 zo 


HIV-loop2 


126 


3 


VH3-13-7 


lG 


1 6,3% 


I Z 


HIV-loop35 


126 


3 


VH3-13-7 


16 


16,3% 


1 Z 


TRO 


122 


3 


VH3-13-1 


13 


13,3% 


61 


SA-4B 


123 


3 


VH3-13-1 


1 ^ 


1 5.3% 


125 


L2B5 


98 


3 


VH3-13-13 


n 
\j 


0,0% 


A H 

46 


S6E11 


95 


3 


VH3-13-13 


n 




46 


S6H7 


100 


3 


VH3-13-13 




0.0% 


46 


SSI 


102 


3 


VH3-13-13 


n 


0.0% 


A P" 

46 


ss8 


94 


3 


VH3-13-13 


0 


0.0% 


A C 

46 


DOB 


120 


3 


VH3-13-26 


21 


21,4% 


1 lb 


THY-33 


115 


3 


VH3-13-15 


20 


20.4% 


A O 

4z 


NOV 


118 


3 


VH3-13-19 


14 


14,3% 


38 


rsvl3H 


120 


3 


VH3-13-24 


20 


20,4% 


11 


L3G11 


98 


3 


VH3-13-20 


2 


2.0% 


46 


L2E8 


99 


3 


VH3-13-19 


0 


0.0% 


46 


L2D10 


101 


3 


VH3-13-10 


1 


1,0% 


46 


L2E7 


98 


3 


VH3-13-10 


1 


1.0% 


46 



73 

SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2C: (continued) 



PCT/EP96/03647 



» 1 1 

Name 


7 

aa 2 


Computed Germline 
family 5 gene 4 


Diff. to % diff. to 
germline* germline 6 


Referene< 


L3A10 


100 


3 


VH3-13-24 


0 


0.0% 


46 


L2E5 


97 


3 


VH3-13-2 


1 


1.0% 


46 


BUR 


119 


3 


VH3-13-7 


21 


21,4% 


67 


S4D5 


107 


3 


VH3-11-3 


1 


1.0% 


46 


19 


116 


3 


VH3-13-16 


4 


4.1% 


118 


S5D4 


99 


3 


VH3-13-1 


0 


0.0% 


46 


S6A8 


100 


3 


VH3-13-1 


0 


0.0% 


46 


HIV-loopl3 


123 


3 


VH3-13-12 


17 


17.3% 


12 

I Z. 


TR1.32 


112 


3 


VH3-11-8 


18 


18.6% 


oo 


L2B10 


97 


3 


VH3-11-3 


1 


1.0% 


4b 


TR1.5 


1 14 


3 


VH3-11-8 


21 


21,6% 


oo 
oo 


s6H9 


101 


3 


VH3- 13-25 


0 


0.0% 


4b 


8 


1 1 2 


3 


VH3-13-1 


6 


6.1% 


i to 

1 1 8 


23 


1 \ 5 

* l 0 


3 


VH3-1.3-1 


6 


6.10/0 


1 18 


7 


1 1 5 


3 


VH3-13-1 


4 


4.1% 


1 18 


TR1.3 




3 


VH3-11-8 


20 


20,6% 


88 


18/2 


125 


3 


VH3- 13-10 


0 


0,0% 


32 


18/9 


125 


3 


VH3-13-10 


0 


0.0% 


31 


30P1 


119 


3 


VH3-13-10 


0 


0,0% 


106 


HF2-1/17 


17R 


3 


VH3-13-10 


0 


0.0% 


8 


A77 


109 


3 


VH3-13-10 


0 


0.0% 


44 


B19.7 


108 


3 


' VH3-13-10 


0 


0.0% 


44 


M43 


119 


3 


VH3-13-10 


0 


0,0% 


103 


1/17 


125 


3 


VH3-13-10 


0 


0,0% 


O 1 

S\ 


18/17 


125 


3 


VH3-13-10 


0 


0,0% 


Jl 


E54 3.4 


109 


3 


VH3-13-10 


0 


0,0% 


ZD 


LAMBDA-VH26 


98 


3 


VH3-13-10 


1 


1,0% 




E54 3.8 


111 


3 


VH3-13-10 


1 


1,0% 


ZD 


GL16 


106 


3 


VH3-13-10 


1 


1.0% 


*T*T 


4G12 


125 


3 


VH3-13-10 


1 


1,0% 


. 56 


A73 


106 


3 


VH3-13-10 


2 


2,0% 


44 


AL1.3 


111 


3 


VH3-13-10 


3 


3,1% 


117 


3.A290 


118 


3 


VH3-13-10 


2 


2,0% 


108 


Ab18 


127 


3 


VH3-13-8 


2 


2,0% 


100 


E54 3.3 


105 


3 


VH3-13-10 


3 


3,1% 


26 


35G6 


121 


3 


VH3-13-10 


3 


3,1o/o 


57 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCT/EP96/03647 



IN all It- 


aa 2 


Computed 
family 3 


Germline 
gene 4 


Diff. to 
germline 5 


% diff. to 
germline* 


Reference' 


A95 


107 


3 


VH3-13-10 


5 


5,1% 


44 


Ab25 


128 


3 


VH3-13-10 


5 


5,1% 


100 


N87 


12G 


3 


VH3-13-10 


4 


4.1% 


77 


ED8.4 


99 


3 


VH3-13-10 


6 


6,1% 


2 


RF-KL1 


122 


3 


VH3-13-10 


6 


6,1% 


82 


ALU 


112 


3 


VH3-13-10 


2 


2,0% 


1 17 


AL3.11 


102 


3 


VH3-13-10 


1 


1.0% 


1 17 


32.B9 


127 


3 


VH3-13-8 


G 


6,1% 


129 — 


TK1 


109 


3 


VH3-13-10 


2 


2,0% 


1 17 


POP 


123 


3 


VH3-13-10 


8 


8.2% 


115 


9F2H 


127 


3 


VH3-13-10 


9 


9.2% 


127 


VD 


115 


3 


VH3-13-10 


9 


9.2% 


10 


Vh38CI.10 


121 


3 


VH3-13-10 


8 


8,2% 


74 


Vh38CI.9 


121 


3 


VH3-13-10 


8 


8.2% 


74 


Vh38C1.8 


121 


3 


VH3-13-10 


8 


8.2% 


74 


G3P1 


120 


3 


VH3-11-8 


0 


0.0% 


104 


G0P2 


117 


3 


VH3-11-8 


0 


0.0% 


104 


AL3.5 


90 


3 


VH3-13-10 


■2 


2.0% 


117 


GF4/1.1 


123 


3 


VH3-13-10 


10 


10.2% 


39 


Ab21 


126 


3 


VH3-13-10 


12 


12.2% 


100 


TDd Vp 


118 


3 


VH3-13-17 


2 


2.t)o/o 


16 


Vh38CI.4 


119 


3 


VH3-13-10 


8 


8.2% 


74 


VH38CI.5 


119 


3 


VH3-13-10 


8 


8.2% 


74 


AL3.4 


104 


3 


VH3-13-10 


1 


1.0% 


117 


FOG 1 -A3 


115 


3 


VH3-13-19 


2 


2.0% 


42. 


HA3D1 


117 


3 


VH3-13-21 


1 


1.0% 


81 


E54 3.2 


112 


3 


VH3-13-24 


0 


0.0% 


26 


mAb52 


128 


3 


VH3-13-12 


2 


2.0% 


51 


mAb53 


128 


3 


VH3-13-12 


2 


2.0% 


51 


mAb56 


128 


3 


VH3-13-12 


2 


2.0% 


51 


mAb57 


128 


3 


VH3-13-12 


2 


2.0% 


51 


mAb58 


128 


3 


VH3-13-12 


2 


2.0% 


51 


mAb59 


128 


3 


VH3-13-12 


2 


2.0% 


51 


mAb105 


128 


3 


VH3-13-12 


2 


2.0% 


51 


mAb107 


128 


3 


VH3-13-12 


2 


2.00/0 


51 


E55 3.14 


110 


3 


VH3-13-19 


0 


0.00/b 


26 



7^ 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2C: (continued) 



PCT/EP96/03647 



Name 


2 

as 


Computed 
family 3 


Germline 
gene 4 


Diff. to % diff. to 
germline 5 germline 6 


nerercm 


F13-28 


106 


3 


VH3-13-19 


1 


1,0% 


94 


mAb55 


127 


3 


VH3-13-18 


4 


4,1% 


51 


YSE 


117 


3 


VH3- 13-24 


6 


6,1% 


72 


E55 3.23 


106 


3 


VH3-13-19 


2 


2,0% 


26 


RF-TS5 


101 


3 


VH3-13-1 


3 


3.1% 


85 


N42P5 


124 


3 


VH3-13-2 


7 


7.1% 


77 


FOG1-H6 


110 


3 


VH3-13-16 


7 


7,1% 


42 


0-81 


115 


3 


VH3-13-19 


11 


11.2% 


47 


HIV-S8 


122 


3 


VH3-13-12 


11 


11.2% 


12 


mAb114 


125 


3 


VH3-13-19 


12 


12.2% 


71 


33.F1 2 


116 


3 


VH3-13-2 


4 


4.1% 


129 


484 


119 


3 


VH3-1X-3 


0 


0.0% 


101 


M26 


123 


3 


VH3-1X-3 


0 


0.0% 


103 


VHGL 3.1 


100 


3 


VH3-1X-3 


0 


0.0% 


26 


E55 3.13 


113 


3 


VH3-1X-3 


1 


1.0% 


26 


SB5/D6 


101 


3 


VH3-1X-6 


3 


3.0% 


2 


RAY4 


101 


3 


VH3-1X-6 


3 


3.0% 


2 


82-D V-D 


106 


3 


VH3-1X-3 


5 


5.0% 


112 


MAL 


129 


3 


VH3-1X-3 


5 


5.0% 


72 


LOC 


123 


3 


VH3-1X-6 


5 


5,0% 


72 


L5F2 


101 


3 


VH3-1X-6 


11 


11,0% 


2 


HIB RC3 


100 


3 


• VH3-1X-6 


11 


11,0% 


1 


56P1 


119 


3 


VH3-13-7 


0 


0.0% 


104 


M72 


122 


3 


VH3-13-7 


0 


0.0% 


103 


M74 


121 


3 


VH3-13-7 


0 


0,0% 


103 


E54 3.5 


105 


3 


VH3-13-7 


0 


0.0% 


26 


2E7 


123 


3 


VH3-13-7 


0 


0.0% 


63 


2P1 


117 


3 


VH3-13-7 




n no/n 


104 


RF-SJ2 


127 


3 


VH3-13-7 


1 


1,0% 


83 


PR-TSl 


114 


3 


VH3-13-7 


1 


1.0% 


85 


KIM46H 


127 


3 


VH3-13-13 


0 


0,0% 


18 


E55 3.6 


108 


3 


VH3-13-7 


2 


2.0% 


26 


E55 3.10 


107 


3 


VH3-13-13 


1 


1,0% 


26 


3.BG 


114 


3 


VH3-13-13 


1 


1,0% 


108 


E54 3.6 


110 


3 


VH3-13-13 


1 


1,0% 


26 


FL2-2 


114 


3 


VH3-13-13 


1 


1 ,0% 


80 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCT/EP96/03647 



Name 


3d 


cumpuicu 


flprmlinp 

filling 


Diff. to 


% diff. to 
germline' 


Reference 




rarniiy 


ycnc 


germline 5 




RF-SJ3 


112 


3 


VH3-13-7 


2 


2.0% 


85 


E55 3.5 


105 


3 


VH3-13-14 


1 


1,0% 


26 


BSA3 


121 


3 


VH3-13-13 


1 


1,0% 


73 


HMST-1 


119 


3 


VH3-13-7 


3 


3,1% 


130 


RF-TS2 


126 


3 


VH3-13-13 


4 


4.1% 


82 


E55 3.12 


109 


3 


VH3-13-15 


0 


0,0% 


26 


19.E7 


126 


3 


VH3-13-14 


3 


3.1% 


129 


11-50 


119 


3 


VH3-13-13 


6 


6.1% 


130 


E29.1 


120 


3 


VH3-13-15 


2 


2,0% 


25 


E55 3.16 


108 


3 


VH3-13-7 


6 


6,1% 


26 


TNF-E1 


117 


3 


VH3-13-7 


7 


7,1% 


42 


RF-SJ1 


127 


3 


VH3-13-13 


6 


6,1% 


83 


F0G1-A4 


116 


3 


VH3-13-7 


8 


8.2% 


42 


TNF-A1 


117 


3 


VH3-13-15 


4 


4,1% 


42 


PR-SJ2 


107 


3 


VH3-13-14 


8 


8,2% 


85 


HN.14 


124 


3 


VH3-13-13 


10 


10.2% 


33 


CAM' 


121 


3 


VH3-13-7 


12 


12,2% 


65 


HIV-B8 


125 


3 


VH3-13-7 


9 


9,2% 


12 


HIV-b27 


125 


3 


VH3-13-7 


9 


9,2% 


12 


HIV-b8 


125 


3 


VH3-13-7 


9 


9,2% 


12 


HIV-S4 


125 


3 


VH3-13-7 


9 


9,2% 


12 


HIV-B26 


125 


3 


VH3-13-7 


9 


9,2% 


12 


HIV-B35 


125 


3 


VH3-13-7 


10 


10.2% 


12 


HIV-bl8 


125 


3 


VH3-13-7 


10 


10.2% 


12 


HIV-b22 


125 


3 


VH3-13-7 


11 


11.2% 


.12 


H!V-b13 


125 


3 


VH3-13-7 


12 


12.2% 


12 


333 


117 


3 


VH3-14-4 


24 


24,0% 


24 


1H1 


120 


3 


VH3-14-4 


24 


24.0% 


24 


1B11 


120 


3 


VH3-14-4 


23 


23,00/0 


24 


CLL30 2-3 


86 


3 


VH3-13-19 


1 


1,0% 


29 


6A 


110 


3 


VH3-13-7 


19 


19,4% 


36 


JeB 


99 


3 


VH3-13-14 


3 


3.1% 


7 


GAL 


110 


3 


VH3-13-19 


10 


10,2% 


126 


KSH6 


119 


3 


VH3-1X-6 


18 


18.0% 


60 


K4B8 


119 


3 


VH3-1X-6 


18 


18.0% 


60 


K5B8 


119 


3 


VH3-1X-6 


18 


18.0% 


60 



?7- 

SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2C: (continued) 



PCT/EP96/03647 



Name 


aa 2 


Computed 
family 3 


Germline 

A 

gene 


Diff. to % diff. to 
germline s germline* 


Reference 


K5C7 


119 


3 


VH3-1X-6 


19 


19,0% 


60 


K5G5 


119 


3 


VH3-1X-6 


19 


19.0% 


60 


KGF5 


119 


3 


VH3-1X-6 


19 


19.0% 


60 


AL3.16 


98 


3 


VH3-13-10 


1 


1,0% 


117 


N86P2 


98 


3 


VH3-13-10 


3 


3.1% 


77 


N54P6 


95 


3 


VH3-13-16 


7 


7,1% 


77 


LAMBDA HT112-1 


126 


4 


VH4-11-2 


0 


0,0% 


3 


HY18 


121 


4 


VH4-1 1-2 

VI IT It £, 


0 


0.0% 


43 — 


mAb63 


126 


4 


VH4-11-2 


0 


0.0% 


45 


FS-3 


105 


4 


VH4-1 1-2 

v i it I I L. 


0 


0,0% 


86 


FS-5 


1 1 1 


4 


VH4-1 1-2 

VI It II Z, 


0 


0,0% 


86 


FS-7 


107 


4 


VH4-1 1-2 


0 


0,0% 


86 
oo 


FS-8 


110 


4 


VH4-1 1-2 


0 


0,0% 




PR-TS2 


105 


4 


VH4-1 1-2 


0 


0.0% 


(J «J 


RF-TMC 


102 


4 


VH4-1 1-2 

V 1 IT 1 1 Z 


0 


0.0% 


8t 


mAb216 


122 


4 

T 


VH4-1 1-2 

v i it I t Z. 


1 


1.0% 


I D 


mAh410 7 FQ1 


122 


A 
t 


VH4-1 1-9 


1 


1,0% 




mAhAfiH40^ 


124 


A 
*t 


VHA-1 1-9 

VflT* 1 1 Z 


1 


1,0% 




Ab44 


127 


4 


VH4-1 1-9 

V 1 IT 1 1 Z 


2 


2.1% 


1 nn 


GH-3C4 


124 


4 
t 


VH4-1 1-2 

V lit 1 1 z 


3 


3.1% 




FS-6 


108 


4 


VH4-1 1-2 

VI IT 1 1 Z 


6 


6,2% 


oo 


FS-2 


114 


4 


VH4-1 1-2 


6 


6,2% 


o*+ 


HI61 


126 


4 


VH4-1 1-2 

V I IT II Z. 


7 


7,2% 


R9 


F5-4 


105 


4 


VH4-1 1-2 


8 


8.20/0 


86 


SA-4A 


123 


4 


VH4-11-2 

V 1 IT l| Z. 


9 


9.3% 


12^ 

1 ZJ 


LES-C 


119 


4 


VH4-11-2 


10 


10,3% 




DI 


78 


4 


VH4-11-9 


16 


16,5% 


^8 

JO 


Ab26 


126 


4 


VH4-31-4 


8 


8.1% 


100 

1 uu 


TS2 


124 


4 


VH4-31-12 


15 


15,2% 


1 in 


265-695 


115 


4 


VH4-11-7 


16 


16,5% 


5 


WAH 


129 


4 


VH4-31-13 


19 


19.2% 


93 


268-D 


122 


4 


VH4-11-8 


22 


22,7% 


6 


58P2 


118 


4 


VH4-11-8 


0 


0,0% 


104 


mAb67 


128 


4 


VH4-21-4 


1 


1,0% 


45 


4139 


115 


4 


VH4-1 1-8 


2 


2,1% 


108 


mF7 


in 


4 


VH4-31-13 


3 


3.0% 


75 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCI7EP96/03647 



Name 1 


aa l 


Computed 
family 3 


Germline 

A 

gene 


Diff. to °A> diff. to 
germline 5 germline 6 


Kererenc 


33.C9 


122 


4 


VH4-21-5 


7 


7.1% 


129 


Pag-1 


124 


4 


VH4-11-16 


5 


5.2% 


50 


B3 


123 


4 


VH4-21-3 


8 


8.2% 


53 


IC4 


120 


4 


VH4-11-8 


6 


6.2% 


70 


C6B2 


127 


4 


VH4-31-12 


4 


4.0% 


48 


N78 


118 


4 


VH4-11-9 


11 


11.3% 


77 


B2 


109 


4 


VH4-11-8 


12 


12.4% 


53 


WRD2 


123 


4 


VH4-11-12 


6 


6.2% 


90 


mAb426 4.2F20 


126 


4 


VH4-11-8 


2 


2.1% 


52 


E54 4.58 


115 


4 


VH4-11-8 


1 


1.0% 


26 


WRD6 


123 


4 


VH4-11-12 


10 


10,3% 


90 


mAb426 12 3F1 4 


122 


4 


VH4-11-9 


4 


4.1% 


52 


F*U 4 2 


108 


4 


VH4-21-6 


2 


2.0% 


26 


WIL 


127 


4 


VH4-31-13 


0 


0.0% 


90 


COF 


126 


4 


VH4-31-13 


0 


0.0% 


90 


LAR 


122 


4 


VH4-31-13 


2 


2.00/0 


90 


WAT 


125 


4 


VH4-31-13 


4 


4.0% 


90 


mAbGl 


123 


4 


VH4-31-13 


5 


5.1% 


45 


WAG 


127 


4 


VH4-31-4 


0 


0.0% 


90 


RF-SJ4 


108 


4 


VH4-31-12 


2 


2.0% 


85 


E54 4.4 


110 


4 


VH4-11-7 


0 


0,0% 


26 


F 1 ^ 4 A1 


108 


4 


VH4-11-7 


0 


0.0% 


26 


r r\ j-j i 


103 


4 


VH4-11-7 


1 


1.0% 


85 


£^4 4 21 


1 1 1 


4 


VH4-11-7 


1 


1.0% 


26 


CM 7 7-2 


97 


4 


VH4-11-12 


0 


0,0% 


29 




95 


4 


VH4-11-12 


0 


0.0% 


104 


ALL52 30-2 


91 


4 


VH4-31-12 


4 


4.0% 


29 


EBV-21 


98 


5 


VH5-12-1 


0 


0.0% 


13 


CB-4 


98 


5 


VH5-12-1 


0 


0.0% 


13 


CLL-12 


98 


5 


VH5-12-1 


0 


0.0% 


13 


13-4 


QQ 


c 
O 


vn3 1 z. \ 


0 


0,0% 


13 


CLL11 


98 


5 


VH5-12-1 


0 


0,0% 


17 


C0RD3 


98 


5 


VH5-12-1 


0 


0.0% 


17 


C0RD4 


98 


5 


VH5-12-1 


0 


0.0% 


17 


C0RD8 


98 


5 


VH5-12-1 


0 


0,0% 


17 


C0RD9 


98 


5 


VH5-12-1 


0 


0,0% 


17 

















SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCT/EP96/03647 



Name 1 aa 2 Computed Germline Diff. to % diff. to Reference 7 

family 3 gene 4 germline 5 germline 6 



CD+1 


98 


5 


CD+3 


98 


5 


CD+4 


98 


5 


CD-1 


98 


5 


CD-5 


98 


5 


VER614 


98 


5 


PBL1 


98 


5 


PBL10 


98 


5 


STRAb SA-1A 


127 


5 


DOB' 


122 


5 


VERG5 


98 


5 


PBL2 


98 


5 


Tu16 


119 


5 


PBL12 


98 


5 


CD+2 


98 


5 


CORD 10 


98 


5 


PBL9 


98 


5 


CORD2 


98 


5 


PBLG 


98 


5 


C0RD5 


98 


5 


CD-2 


98 


5 


C0RD1 


98 


5 


CD-3 


98 


5 


VERG4 


98 


5 


PBL13 


98 


5 


PBL7 


98 


5 


HAN 


119 


5 


VERG3 


98 


5 


PBL3 


98 


5 


VERG7 


98 


5 


PBL5 


94 


5 


CD-4 


98 


5 


CLL10 


98 


5 


PBL11 


98 


5 


C0RD6 


98 


5 


VERG2 


98 


5 



VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


0 


0,0% 


125 


VH5-12-1 


0 


0,0% 


97 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 




1,0% 


17 


VH5- 1-2-1 




~"~ 1.0% 


49 


VH5-12-1 


! 


1,0% 


17 


VH5-12-1 




1,0% 


17 


VH5-12-1 




1,0% 


17 


VH 5-12-1 


1 


1,0% 


17 


VH5-12-1 


2 


2,0% 


17 


VH5-12-1 


2 


2.0% 


17 


VH 5-12-1 


2 


2,0% 


17 


VH5-12-1 


2 


2,0% 


17 


VH5-12-1 


2 


2,0% 


17 


VH5-12-1 


3 


3,1% 


17 


VH5-12-1 


3 


3,1% 


17 


VH5-12-1 


3 


3,1% 


17 


VH5-12-1 


3 


3,1% 


17 


VH5-12-1 


3 


3,1% 


97 


VH5-12-1 


3 


3,1% 


17 


VH5-12-1 


3 


3,1% 


17 


VH5-12-1 


3 


3.1% 


17 


VH5-12-1 


0 


0,0% 


17 


VH5-12-1 


4 


4,1o/o 


17 


VH5-12-1 


4 


4,1% 


17 


VH5-12-1 


4 


4,1% 


17 


VH5-12-1 


.4 


4,1% 


17 


VH5-12-1 


5 


5,1% 


17 


©O 









SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCT/EP96/03647 



Name 1 aa 2 Computed Germline Diff. to %diff. to Reference 7 

family 3 gene' 9 erm,ine5 9 ermiine6 



83P2 


119 


5 


VH5-12-1 


0 


0,0% 


103 


VERG9 


98 


5 


VH5-12-1 


6 


6,1% 


17 


ai6 


98 


5 


VH5-12-1 


6 


6,1% 


17 


PBL8 


98 


5 


VH5-12-1 


7 


7,1% 


17 


Ab2022 


120 


5 


VH5-12-1 


3 


3,1% 


100 


CAV 


127 


5 


VH5-12-4 


0 


0.0% 


97 


HOW 


120 


5 


VH5-12-4 




0,0% 


97 


PET 


127 


5 


VH5-12-4 


0 


0.0% 


97 


ANG 


121 


5 


VH5-12-4 


0 


0,0% 


97 


KER 


121 


5 


VH5-12-4 


0 


0.0% 


97 


5.M13 


118 


5 


VH5-12-4 


0 


0,0% 


107 


Au2.1 


118 


5 


VH5-12-4 


1 


1,0% 


49 


WS1 


126 


5 


VH5-12-1 


9 


9,2% 


110 


TDVn 


98 


5 


VH5-12-4. 


1 


1.0% 


16 


TEL13 


116 


5 


VH5-12-1 


9 


9.2% 


73 


E55 5.237 


112 


5 


VHS-12-4 


2 


2,0% 


26 


VERG1 


98 


5 


VH5-12-1 


10 


10,2% 


17 


CD4-74 


117 


5 


VH5-12-1 


10 


10,2% 


42 


257-D 


125 


5 


VH5-12-1 


11 


11,2% 


6 


CLL4 


98 


5 


VH5-12-1 


11 


11,2% 


17 


CLL8 


98 


5 


VH5-12-1 


11 


11,2% 


17 


Ab2 


124 


5 


VH5-12-1 


12 


12,2% 


120 


Vh383ex 


98 


5 


VH5-12-1 


12 


12,2% 


120 


CLL3 


98 


5 


VH5-12-2 


11 


11,2% 


17 


Au59.1 


122 


5 


VH5-12-1 


12 


12,2% 


49 


TEL16 


117 


5 


VH5-12-1 


12 


12,2% 


73 


M61 


104 


5 


VH5-12-1 


0 


0,0% 


103 


TuO 


99 


5 


VH5-12-1 


5 


5.1% 


49 


P2-51 


122 


5 


VH 5-12-1 


13 


13.3% 


121 


P2-54 


122 


5 


VH5-12-1 


11 


11.2% 


121 


P1-56 


119 


5 


VH5-12-1 


9 


9,2% 


121 


P2-53 


122 


5 


VH5-12-1 


10 


10.2% 


121 


Pl-51 


123 


5 


VH5-12-1 


19 


19,4% 


121 


P1-54 


123 


5 


VH5-12-1 


3 


3,1% 


121 


P3-69 


127 


5 


VH5-12-1 


4 


4.1% 


121 


P3-9 


119 


5 


VH5-12-1 


4 


4.1% 


121 



8 / 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2C: (continued) 



PCT/EP96/03647 



Name 1 


aa 2 


Computed 
family 3 


Germiine 
gene 4 


Diff. to % diff. to 
germiine 5 germiine 6 


Refereni 


1-185-37 


125 


5 


VH5-12-4 


0 


0,0% 


124 


1-1R7-79 


125 


5 


VH5-12-4 


0 


0.0% 


124 


r i jo 


128 


5 


VH5-12-4 


10 


10,2% 


121 


P9.c;7 


lift 




VH5-12-4 


3 


3,1% 


121 


P9 ex. 


17T 


oJ 


VH 5-12-1 


5 


5,1% 


121 


r Z -DO 


173 


*j 


VH 5-1 2-1 


20 


20,4% 


121 


P9-c;9 
r z Ot 


122 


5 


VH5-12-1 


11 


11,2% 


121 


P7-KO 
ro ou 


122 


j; 
%j 


VH5-12-1 


8 


8.2% 


121 


Pi £7 
r I -D/ 


I Z J 




V 1 i J 1 Z. 1 


4 


4.1% 


121 


Pi ^ 
r 1 -DD 


179 

1 ZZ 




V I 1 <J 1 Z 1 


14 


14,3% 


121 


Mm A 


1 9fl 
1 zo 


C 

0 


vnD* i z— *r 


12 


12,2% 




Pi C*> 

r 1-dz 


191 
1 Z 1 


c 
0 


V/Hc;-19 1 
VnD- 1 Z- 1 


11 


11.2% 


171 

1 Z I 


PI 1 c 
LLLd 




r 
D 


vno- l z- 1 


13 


13,3% 


1 7 
1 / 


PI 1 "7 

LLL/ 




c 
d 


\/L|C 19 i 
VnD- 1 z- 1 


14 


14,3% 


1 7 


i on n 
Lzr lU 


inn 


d 


vnD- 1 Z- 1 


1 


1.0% 


AC. 
*tO 


LoDO 


QR 




vnD l Z 1 


1 


1,0% 


46 


Vn0.n I Z 


1 1Q 


c 
o 


vno jj" 1 


13 


12,9% 


177 




107 




VH 6-3^-1 


1 


1,0% 


46 






c 

D 


VHfi-^-l 

V 1 IO OJ 1 


1 


1,0% 


4R 


<k7 

JJJ 


QQ 




VH 6-3^-1 


1 


1.0% 


46 


R-1fi1 


1(11 


c 

D 


vno jj" i 


0 


0,0% 


14 
i *t 






o 


vno jo i 


0 


0.0% 


GR 
DO 


1 1 C 


1 90 
I zu 


D 


VnD-JD- 1 


0 


0,0% 


oy 


(VI/ I 


191 
1 Z 1 


D 


Vno-JD" 1 


0 


0.0% 




MI 1 


1 9n 

1 ZU 


c 
o 


vno-oD- i 


0 


0.0% 




r i j i vi l » 


in7 


o 


VHfi-m-l 

VnD JD" 1 


0 


0.0% 


KR 
DO 


1 CLPl 


197 
1 z / 


c 

D 


vno Oj i 


0 


0.0% 




VH6 NM 

vii o.i v t 


121 


6 
\j 


VI IU JJ 1 


0 


0.0% 


1 77 


Mil 

vno. in i i 


1 91 


c 

D 


VnD JD* 1 


0 


0.0% 


1 99 

I ZZ 


VHG.N12 


123 


6 


VHG-35-1 


0 


0.0% 


122 


VH6.N2 


125 


6 


VHG-35-1 


0 


0.0% 


122 


VHS.N5 


125 


6 


VH6-35-1 


0 


0,0% 


122 


VH6.N6 


127 


6 


VHG-35-1 


0 


0,0% 


122 


VH6.N7 


126 


6 


VH6-35-1 


0 


0,0% 


122 


VH6.N8 


123 


6 


VH6-35-1 


0 


0.0% 


122 


VH6.N9 


123 


G 


VH6-35-1 


0 


0.0% 


122 



8.2. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 2C: (continued) 



PCT/EP96/03647 



aa 2 Computed Germline Diff. to % diff. to Reference 7 
family 3 gene* 9ermline 5 germline 6 



VH6.N10 


123 


6 


VH 6-3 5-1 


0 


0,0% 


i 11 

1 ZZ 


VH6.A3 


123 


6 


VH6-35-1 


0 


0,0% 


Ml 
1 ZZ 


VH6-A1 


124 


6 


VH6-35-1 


0 


0,0% 


Ml 
1 ZZ 


VH6.A4 


120 


6 


VH 6-35-1 


0 


0,0% 


1 11 
1 ZZ 


E55 G.16 


116 


6 


VH6-35-1 


0 


0,0% 


1C 

zb 


E55 6.17 . 


120 


6 


VH6-35-1 


0 


0,0% 


1C 
ZD 


E55 B.G 


120 


6 


VH6-35-1 


0 


0,0% 


26 


VHGL 6.3 


102 


6 


VH6-35-1 


0 


0,0% 


26 


CB-201 


118 


6 


VH6-35-1 


0 


0,0% 


109 


VH6.N4 


122 


6 


VH6-35-1 


0 


0,0% 


122 


E54 6.4 


109 


6 


VH6-35-1 


1 


1,0% 


26 


VH6.A6 


126 


6 


VH 6-35-1 


1 


1,0% 


122 


E55 6.14 


120 


6 


VH6-35-1 


1 


1,0% 


26 


E54 6.6 


107 


6 


VH 6-35-1 


1 


1,0% 


26 


E55 6.10 


112 


6 


VH6-35-1 


1 


1,0% 


26 


E54 6.1 


107 


6 


VH6-35-1 


2 


2,0% 


26 


E55 6.13 


120 


6 


VH6-35-1 


2 


2,0% 


26 


E55 6.3 


120 


6 


VH6-35-1 


2 


2.0% 


26 


E55 6.7 


116 


6 


VH6-35-1 


2 


2,0% 


26 


E55 6.2 


120 


6 


VH6-35-1 


2 


2.0% 


26 


E55 6.X 


111 


6 


VH6-35-1 


2 


2,0% 


26 


E55 6.11 


111 


6 


VH6-35-1 


3 


3,0% 


26 


VH6.A1 1 


118 


6 


VH 6-35-1 


3 


3,0% 


122 


A10 


107 


6 


VH6-35-1 


3 


3,0% 


68 


E55 6.1 


120 


6 


VH6-35-1 


4 


4,0% 


26 


FK-001 


124 


6 


VH6-35-1 


4 


4,0% 


65 


VH6.A5 


121 


6 


VH6-35-1 


.4 


4,0% 


122 


VH6.A7 


123 


6 


VH6-35-1 


4 


4,0% 


122 


HBp2 


119 


6 


VH6-35-1 


4 


4,0% 


4 


Au46.2 


123 


6 


VH6-35-1 


5 


S.OO/o 


49 


A431 


106 


6 


VH6-35-1 


5 


5,0% 


68 


VH6.A2 


120 


6 


VH6-35-1 


5 


5,0% 


122 


VH6.A9 


125 


6 


VH6-35-1 


8 


7,9% 


122 


VH6.A8 


118 


6 


VH6-35-1 


10 


9,9% 


122 


VH6-FF3 


118 


6 


VH6-35-1 


2 


2.0% 


123 


VH6.A10 


126 


6 


VH6-35-1 


12 


11,9% 


122 



SL3 

SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 

Table 2C: (continued) 



PCT/EP96/03647 



Name 1 


aa 2 


Computed 
family 3 


Germline 
gene 4 


Diff. to % diff. to 
germline 5 germline 6 


Reference 7 


v/uc coin 
Vnb-tDlu 


117 


b 


\/UC 7C 1 

Vnb-oD- 1 


3 


3.0% 


i zj 


VH6-tb 


1 1 Q 
119 


6 


Vnb-35-l 


6 


5.9% 


1 97 
1 ZJ 


\/UC CC7 
Vnb-rtZ 


n 1 
IZ 1 


r 
O 


\/L|C *)C 1 

vnb-J5-l 


6 


5.9% 


1 


Vnb-ttb 


1 1 c 

lib 


b 


\/LIC 1C 1 

vnb-Jb- 1 


6 


5.9% 


1 97 
1 Z*> 


\/ur cnin 


1 1 Q 
I to 


b 


vno-Jo- 1 


6 


5.9% 


1 77 
1 Z J 


V no- LAo 


1 1 7 

1 1 J 


b 


\/UIC 7C 1 

vnb-Jb- 1 


6 


5,9% 


1 97 
I Zo 


Vnb-rby 


1/1 


b 


\ /LI C TC 1 

Vnb-35-1 


_ 8 


7.9% 


1 77 
1 ZJ 


\ /Ll /* Cr 


1 16 


6 


VH6-35-1 


9 


8,9% 


lzJ 


VH6-tL8 


122 


6 


VH 6-35-1 


9 


8,9% 


lz3 


VHb-t 10 


120 


6 


VH6-35-1 


10 


9.90/0 


123 


VrlD-rr 1 1 


I ZZ 


b 


\/UC TC 1 


11 


10.9% 


1 97 
1 Zo 


VH6-FD2 


115 


6 


VH6-35-1 


11 


10,9% 


123 


CLL10 17-2 


88 


6 


VH6-35-1 


4 


4.00/o 


29 


VH6-BB11 


94 


6 


VH6-35-1 


4 


4,Oo/o 


123 


VH6-B4I 


93 


6 


VH6-35-1 


7 


6.9% 


123 


JU17 


102 


6 


VH6-35-1 


3 


3.00/o 


114 


VH6-BD9 


96 


6 


VH6-35-1 


11 


10.90/o 


123 


VH6-BB9 


94 


6 


VH6-35-1 


12 


11.9% 


123 



s>4 
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Table 3A: assignment of rearranged V kappa sequences to their germline counterparts 



Family 1 


Name 


Rearranged 1 


Sum 




Vkl-1 


28 






Vkl-2 


0 




1 


Vkl-3 


1 






Vk!-4 


0 






Vkl-5 


7 






Vkl-6 


0 




1 


Vkl-7 


0 




, 


Vkl-8 


2 




1 


Vkl-9 


9 




, 


Vkl-10 


0 




1 


Vkl-H 


1 






Vkl-12 


7 






Vkl-13 


1 






Vkl-!4 


7 






Vkl-15 


2 




1 


Vkl-16 


2 




, 


Vkl-17 


16 






Vkl-I8 


1 






Vkl-19 


33 






Vkl-20 


1 




1 


Vk!-21 


1 




1 


Vkl-22 


0 




, 


Vkl-23 


0 


119 entries 


2 


Vk2-I 


0 




2 


Vk2-2 


1 




2 


Vk2-3 


0 




2 


Vk2-4 


0 




2 


Vk2-5 


0 




2 


Vk2-6 


16 




2 


Vk2-7 


0 




2 


Vk2-8 


0 




2 


Vk2-9 


1 




2 


Vk2-I0 


0 




2 


Vk2-ll 


7 




2 


Vk2-I2 


0 


25 entries 


3 


Vk3-I 


1 




3 


Vk3-2 


0 





8S 
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Family 1 


Name 


Rearranged 


Sum 


3 


Vk3-3 


35 




3 


Vk3-4 


115 




3 


Vk3-5 


0 




.3 


Vk3-6 


0 




3 


Vk3-7 


1 




3 


Vk3-8 


40 


192 entries 


4 


Vk4-1 


33 


33 entries 


5 


Vk5-1 


1 


1 entry 


6 


Vk6-1 


0 




6 


Vk6-2 


0 


0 entries 


7 


Vk7-1 


0 


0 entries 



§><<- 
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Table 3B: assignment of rearranged V lambda sequences to their germline counterparts 



Family' 


Name 


Rearranged 2 


Sum 


1 


DPL1 


1 




1 


DPL2 


14 




1 


DPL3 


6 




1 


DPL4 


1 






HUMLV117 


4 




1 


DPL5 


13 




1 


0PL6 


0 




1 


DPL7 


0 




1 


DPL8 


3 




1 


DPL9 


0 


42 entries 


2 


DPL10 


5 




2 


VLAMBDA 2.1 


0 




2 


0PL11 


23 




2 


DPL12 


15 




2 


DPL13 


0 




2 


DPL14 


0 


43 entries 


3 


DPLlb 


10 




3 


HDI *>*> 

UrLzJ 


1 Q 

i-y 






Humlv318 

1 lUilll V J IU 


9 


38 entries 


7 


DPL18 


1 




7 


DPL19 


0 


1 entries 


8 


DPL21 


2 




8 


HUMLV801 


6 


8 entries 


9 


DPL22 


0 


0 entries 


unassigned 


DPL24 


0 


0 entries 


10 


gVLX-4.4 


0 


0 entries 



8 r 
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Table 3C: assignment of rearranged V heavy chain sequences to their germiine counterparts 



Family' Name Rearranged' Sum 

VH1-12-1 38 

VH1-12-8 2 

VH1-12-2 2 

VH1-12-9 2 

VH1-12-3 0 

VH1-12-4 0 

VH1-12-5 3 

VH1-12-6 0 

VH1-12-7 23 

VH1-13-1 1 

VH1-13-2 1 

VH1-13-3 0 

VHl-13-4 0 

VH1-13-5 0 

VH1-13-6 17 

VH1-13-7 0 

VH1-13-8 3 

VH1-13-9 0 

VH1-13-10 0 

VH1-13-1 1 0 

VH1-13-12 10 

VH1-13-13 0 

VH1-13-14 0 

VH1-13-15 4 

VH1-13-16 2 

VH1-13-17 0 

VH1-13-18 1 

VH1-13-19 0 

VH1-1X-1 1 110 entries 

~2 VH2-21-1 0 

2 VH2-31-1 0 

2 VH2-31-2 1 

2 VH2-31-3 1 

2 VH2-31-4 0 

2 VH2-31-5 2 

2 VH2-31-6 0 

2 VH2-31-7 0 
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Family 1 


Name 


Rearranged 2 Sum 


2 


\ /U *) "11 1 A 

Vn/-J i - 1 *fr 


i 


i 
2 


\/ui li Q 




2 






1 

2 


VrlZ- J 1 - 1 U 


n 


2 


vnz-J i ■ i 


i 


i 

2 


Vni-J 1 - 1 1. 


n 

VI 


i 

2 


\/Ul 11 11 


1 7 pnfr/p** 


3 


\ /u i 1 i 1 

VH3-1 l-l 


U 


3 


\ /111 11 1 

VH3-1 I -2 


U 


3 


\ /LI ■) 1 1 O 

VH3-1 i -J 


r 

b 


3 


\ /LI Oil Vt 

VH3-1 1-4 


n 


3 


VH3-1 1-5 


i 


3 


VH3-1 1-6 


1 


3 


\ /I 1 1 11 1 

VH3-1 l -7 


0 


3 


VH3-1 1-8 


5 


3 


I/mi i *i 1 
VH3-13-1 


9 


3 


l /I 1 1 >t 1 i 

VH3-13-2 


*> 
J 


3 


VH3-13-3 


0 


3 


1 /I 1 1 11 A 

VH3-13-4 


0 


3 


VH3-13-5 


0 


3 


VH3-13-6 


0 


3 


VH3-13-7 


32 


3 


\ t\ 1 1 il o 

VH3-1 3-8 


4 


3 


\ /l J 1 1 1 o 

VH3-13-9 


U 


3 


VHJ-1 J-1U 




3 


\ /Li 1 11 11 

VH3-13-1 1 


U 


3 


\ /!_! 1 1 1 1 O 

VH3-1 S'M 


1 1 
I 1 


3 


VriJ- 1 J- 1 J 


i 7 


3 


\/U*> 1*3 \A 
VrU-1 J- 14 


o 
o 


3 


VH3-13-15 


4 


3 


VH3-13-16 


3 


3 


VH3-13-17 


2 


3 


VH3-13-18 


1 


3 


VH3-13-19 


13 


3 


VH3-13-20 


1 


3 


VH3-13-21 


1 


3 


VH3-13-22 


0 
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Table 3C: (continued) 



Family' Name Rearranged 2 Sum 



3 VH3-13-23 0 

3 VH3-13-24 4 

3 VH3-13-25 1 

3 VH3-13-26 6 

3 VH3-14-1 1 

3 VH3-14-4 15 

3 YH3-14-2 0 

3 VH3-14-3 0 

3 VH3-1X-1 0 

3 VH3-1X-2 0 

3 VH3-1X-3 6 

3 VH3-1X-4 0 

3 VH3-1X-5 0 

3 VH3-1X-6 11 

3 VH3-1X-7 0 

3 VH3-1X-8 1 

3 VH3-1X-9 0 212 entries 

4 VH4-11-1 0 
4 VH4-11-2 20 
4 VH4-11-3 0 
4 VH4-11-4 0 
4 VH4-11-5 0 
4 VH4-11-6 0 
4 VH4-11-7 5 
4 VH4-11-8 7 
4 VH4-11-9 3 
4 VH4-11-10 0 
4 VH4-11-11 0 
4 VH4-11-12 4 
4 VH4-11-13 0 
4 VH4-11-14 0 
4 VH4-11-15 0 
4 VH4-11-16 1 
4 VH4-21-1 0 
4 VH4-21-2 0 
4 VH4-21-3 1 
4 VH4-21-4 1 
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Family 1 


Name 


Rearranged 


Sum 


A 
*r 


VH4-21-5 


1 




A 

*T 


VH4-21-6 


1 




A 


VH4-21-7 


0 




4 




0 




A 
4 


vn*T ^ i j 


o 




A 
4 


v n*r i i 


o 




A 
4 


VH4-11-2 


o 




4 


v n*T _ o i j 


o 




4 


VH4-11-4 

V 1 1 " 1 ■ 


2 




A 

4 


VH4-11 


o 




4 


\/HA 11 -fi 
VriH-j 1 0 


o 




4 


\/HA 1 1 -7 


o 




4 


\/WA_7 1 -ft 


o 




4 




n 




A 

4 


VH4.-11-10 


o 




4 


VH4-31-1 1 


o 




4 


vriT'j l it 


4 




4 


VH4-31-13 


7 




/ 


VH4-31-14 


0 




A 
*r 


VH4-31-15 


0 




4 




0 




*r 


VH4-31-17 


0 




A 

4 


\/HA-11 -1ft 
vnH J i - i o 


o 




4 


VH4-31-19 


0 




4 


VH4-31-20 


0 


57 entries 


5 


VH5-12-1 


82 




5 


VH5-12-2 


1 




5 


VH5-12-3 


0 




5 


VH5-12-4 


14 


97 entries 


6 


VH6-35-1 


74 


74 entries 
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framework I 



amino acid 1 



«— CNI TO 



A 




1 














1 








102 




1 




B 






1 






l| 






















C 




























1 






D 


64 
































E 


8 




14 
























1 




F 


















1 


6 








1 






G 
































105 j 


H 




65 






























1 


























4 




K 






1 


























— i 


L 




6 




21 














96 




1 




, 


M 


1 






66 
























N 


































P 
















103 




1 




2- 






1 




Q 






62 






| 88 










1 












R 


































S 














89 




102 


80 




103' 




103 






T 




1 






1 88 










18 












V 




1 


i 9 
















8 




2 




98 



w 



X 1 

Y 



unknown (?) 



not sequenced 


31 


131! 18! 


18' 


17 


16 


I 16 


2 


1 
















sum of seq 7 
oomcaa 3 
mcaa 4 

rel. oomcaa 4 

pos occupied 6 


74 


! 74| 87 : 


87 


88 


89 


i 89 


103 


1 104 


! 105 


: 105 


105 


105 


105 


105 


105! 


- G4 


| 65! 62 


66 


88 


88 


j 89 


103 


j 102 


j 80 


! 96 


103 


102 


103 


98 


105! 


' D 


1 1 I Q 


M 


T 


Q 


! S 


P 


| S 


S 


I L 


S 


A 


S 


V 


G j 


i s> 

: CO 


i -e ' s 

! S|! 
l CO ' 

i eoi n 


1 2> 


. O 
O 


.0 

i §> 

: CT> 


: ^ 
: o" 
O 

i o 


! & 


: B" 
• CO 

1 en 


■ 3> 


■ ° 

cn 


o~ 
CO 

, cn 


.0 

3" 
cn 


CO 
CO 


■ o 


i 

O • 
O ; 
O j 


| 4! 5! 5 


! 2 


j 1 


I 2 


j 1 


! 1 


1 3 


4 


i 3 


2 


3 


3 


; 5 





SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 4A: Analysis of V kappa subgroup 1 



PCT/EP96/03647 



amino acidV ^ 2 S n c c ^ c 



A 






i| 


ii 




ii 






103| 














R 

D J 






















1| 










" C 1 














105! 


















n 


1 01 1 

1 \J 1 : 






























c 

L 


?! 

: 














1 


1| 




2! 










C 

r 






























ri 




















ij 












u 
n 






6! 










— i 






ii 








I 
i 






4! 


101 1 


i| 


















i 


i\ 
















2! 






ii 










L 







- 










ij 
















M 
M 










« ■ — 


















In 








... 














1 












p 

r 


























u 
















20 






100 










D 

n 
















81 














| i 






c; 




1 










102 










T 
1 




c 

O 




\ qq 




103 






| 1 


1 












V 






: qa 




2 




















I — j 


w 






























X 


1 

I 






























Y 


1 






















































j 105 


! 105 


I 105 


! 105! 


UIIMIUWII \Z ) 
































not sequenced 


























sum of seq 3 


,.. 
! 105 


! 10E 


»| 105 


j 105 


1 105 


1 105 


! 105 


j 105 


I 105 


I 105 


! 105 


I 105 


j 105 


1 105 


jjosj 
! iosj 


oomcaa 3 
mcaa 4 

reL oomcaa* 

pos occupied 1 


! 101 


; 9* 


\\ 98 


I 99 


| 101 


! 103 


| 105 


j 81 


! 103 


1 102 


! 100 


j 105 


j 105 


j 105 


! D 


I R 


! v 


| T 


| I 


\ T 


I c 


j R 

: O 

: r-» 

: 


! A 


| S 


I Q 










: & 
: o" 

to 


i £ 


1 m 


i o" 


i 

: O 
: CD 
: cn 


; 

: O 
CO 

: cn 


: 

: O 
: O 

\ o 


I .O 
: o 
: CO 

cn 


: _o 
: CT 

: r-~ 
: <D 


i o 
: O 
: LT) 


1 100% 


1 100% 


\ 

: O 
: O 

1 o 


o 

: o 
: O 
1 O 


| I 




3! : 


i\ l 




si : 


M 1 


! 5 








>! 1 


1 1 


\ 1 


j 1 
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CDRI 



CO 

rsi 



CSI 



o 
ro 



co ro 



ro 
ro 



to 
ro 



to 
ro 



oo 
ro 



CD 
CO 



A 










1j 


i! 




l! 


42! 














B 
























l! 


1! 






. c | 














1! 


















D 






25i 




1 


si 


7j 










1! 








E 9 












l| 










2! 








F 1 








ii 


1 




7| 








6| 












G 






25! 




7 


3l 
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1 


2\ 
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I 
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1 
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\ 


K S 










7\ 




101 
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2 


\ l| 
















M 












! 42! 
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N 
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50 














P 






























102| 


Q 
























98 


103 


2 




R I 








16 


! 3 
| 32 


2 












3 


ij 


S 






41 


2 




3 


1 


1 












T 






7 






1 4 






4 










! 1 




V 






1 


4 


| 1 




21 


1 
















W 


















j 104 












X I 




























Y 1 








j 1 




! 60 








! 98 












105 


: 1 nc 
• \\JD 




























unknown (?) 




























! 3 


j 1 


not sequenced 












1 1 


I 1 


I 1 






\ 1 


; 1 




j 1 


sum of seq J 


1 105 


t 105 


j 105 


j 105 


I 10E 


j| 104| 104 


I 104 


1 104 


I 104 


I 104 


! 104 


j 104 


! 104 


I 104 


oomcaa 5 
mcaa* 

rel. oomcaa' 
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1 105 


! 105 


1 41 


1 98 


! 5> 


'1 42i 60 
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\ 50 
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I 98 


! 103 
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unknown (?) 
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A 8 


• 7fl 


: 
















j 1 








! 5 












B | 








































C 1 








































D I 














\ 1 


























E 1 














| 69 


























F 


























I 2 










1 3 


j 39 




G 






j 1 


i 68 




j 69 






| 1 


| 


\ 69 


j 39 






i i 










| 68j 


H 






1 




































1 


























! 65 


| 38 








1 34 






K 










































L 








1 






168 






j 1 




| 1 












! 2 


! ■ 4 




M 




















I 67 








I 2 








1 4 






N 




























4 








3 


! 22 




P 






68 








1 
















1 44 












Q 


69 








69 


























1 


1 


1| 


R 


1 






1 




1 












4 












1 






S 










1 








1 


I 1 








22 










1 


l! 


T 
















— 











1 


2 


4 






1 


3 




V 
















| 1 






2 


2" 


16 






1 




, , • 


W 














1! 




67 






26! 


















X 






! 

i 




























i 

: 








Y 




1 














1 


















20! 






Z 










































































70! 


70! 








unknown (?) S 








































not sequenced! j 








































sum of seq' j 
oomcaa 1 j 


70| 


70| 


701 


70! 


70| 


70! 


70| 


70f 


70 
67 


\ 70l 


70l 


70! 


70j 


70! 


70! 


70 : ! 


70j 


70! 


70! 


70; 


69! 


70! 


681 


68! 


69| 


69! 


68! 


69| 


I 67| 


69! 


39! 


65| 


38 j 


44! 


70! 


70j 


34! 


39! 


68i 


mcaa' j 
rel. oomcaa 5 ! 


q! 


A | 


P j 


g j 


Q| 


G | 


L j 


E | 


W 


Ml 


G j 


6 j 


1 j 


1 j 


p ! 






1 j 


F | 


G \ 


>p i 

O* : 
CD : 
CD • 


o j 
O i 
O [ 


O" • 
r-N : 
CD : 


o* i 

r~ : 

CD : 


.p : 
o • 
CO • 
CD ■ 


,0 i 
O" : 
cn : 

CT> I 


! 

o" ■ 
r-* : 

CD : 


| 

©~ : 
CO 1 
CO ■ 


# 

CD 
CO 


I 

o" ; 
CO : 
CO : 


CT> : 
CD • 


-5 1 

o" : 
ID ■ 
IT> : 


O : 
O* : 

n : 
Ol • 


#1 
t : 
in ; 


#! 

CO : 
CD ; 


100% | 


C3 : 
O" : 

o • 
O j 


_o ! 
o ; 
m ; 


^p : 
O* : 

CD ; 

LO : 


#j 

CO • 


pos occupied' 1 i 


2! 


i! 


3| 


31 


2\ 


2\ 


3\ 


2j 


4 


4| 


2j 


4| 


4| 


6! 


5! 


ij 


i! 


to) 


6| 


3j 
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A 


1| 


34j 






69j 






















43! 










B 










































C 










































D 




i ij 














2| 














70 








E 




| : 












i! 


















33! 






F 




: 1 


1; 








48| 








3: 




4 














G 


l| 










3i 






67! 






















H 




! ij 




































1 


4| 






















1| 


44 








1 






K 


i; 




2j 


1 






47 = 




i; 




i 














8 


— 


1 


L 


1 


1 












22? 








2| 




1 




3; 








M 




























21 














N 


9| 




59! 








18 




























P 


1 


7 






































Q 


1 


1 








70 






64 : 
























R 


2 












2 




i 




69 














1 






s • 




1 


2 




1 




















i 5 








70 




T 


34 


26 


4 












3 








66 




j 65 


24 




27 




67! 


V 




















1 






3 














■ 3| 


W 








































■ — ; 


X 








































Y 






1 


68 


































Z I 








































unknown (?) 


















































































not sequenced 










































sum of seq' 


j 70 


i ? o 


I 70 


j 70 


! ? o 


1 70 


1 70 


1 70 


1 70 


| 70 


j 70 


! 70 


I 70 


\ 70 


1 70 


\ 70 


! 70 


1 70 


\ 70 


j 70 


oomcaa' 
mcaa' 

rel. oomcaa* 

pos occupied 6 


1 34 


\ 34 


! 59 


j 68 


| 69 


1 70 


j 47 


j 48 


j 64 


! G7 


! 69 


! 65 


j 66 


| 44 


\ 65 


| 43 


1 70 


j 33 


[70 

j s 


[_67 

f F 


j T 


j A 


j N 


1 Y 


1 A 


! Q 


! K 


i F 


j Q 


j G 


! R 


I v 


j T 


j 1 


\ T 


i a 


\ D 


j E 


: 

: O" 
■ CD 
: ^J" 


: O" 
: CD 

: 


i o 
: O* 
: 

• CO 


: O* 

• cn 


': .p 
: O" 
; CD 
■ CD 


I p 
: O* 

■ o 

! ° 


■ o 

: o 

• i*» 

; CD 


■ o 

; o> 

■ to 


j o 
: CD 


• 

\ & 
: to 

• CD 


: o 

: o~ 

; CD 

= CD 


: 

: o 
• CO 

: cn 


• 

: O* 

: 

: CD 


: c> 

: 3" 

• n 
; co 


: 

: o* 
: CO 
■ CD 


: O* 
! (D 


: o 

: 3* 

• o 

: O 


! C3 

: O 

• t 


\ 

: S* 
: O 
1 ° 


; _o 

• & 
: tD 
■ CD 


• — . — 


! 6 

-i 


i 7 


! 3 

: 


Li 


j 1 


J 4 


i 2 


! 5 


! 3 


j 2 


1 3 




j 3 


! 4 


[2 


[3 


j 1 


j _5 


| 1 


Li 
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amino acid 1 ^^S5§SS< CQ ^ 



A 






! 64 






1 










3 






i 1 


1 70 








B 






































• C 




































\ \ 7 0 


D 












! 2 












26 


1 70 












E 












! 64 


j 










44 














F 








i 
























i i 


! 1 


! 2\ . 


G 
















1 






h 
















H 








I 1 






! 1 


























! 1 










3! 1 


1 
















j 2 






K 




















3 
















■ — f i 


L 










3 




63 1 




70 














1 2 






M 
N 










! 67 




















\ 1 




j 1 






4 














1 


16 






















P 








































Q 








1 




3 




























R 


3 














23 


1 




62 


















S 


62 




1 










41 


49 






67 






1 










T 


.1 


69 


2 










3 


2 




4 








67 










V 






3 








4 








1 












64 






W 








































X I 






































Y j 






68 




























69 


68! | 


z 1 














































































unknown (?) 








































not sequenced 
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70- 


70 


70| 


7o; 


70 


70- 


70 


70! 


70j 


70j 


70l 


701 


70! 


70. 


70| 


70| 


70| 
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70| 70! 
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rel. oomcaa 5 
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62. 


69 


641 


68 ! 


67 


64- 


63 


41j 


49! 


70l 


62! 


67l 


44 ! 


70| 


67; 


70| 


64! 


69! 


68| 70| 


s : 

#! 
co ; 


T i 

en i 
en • 


A| 


Y j 


M 


E : 


L 


S ; 


5 | 


L j 


R 1 


S | 


E | 


d i 


T j 


a| 


V j 


Y j 


Y ! C j 




i 

CD : 


# 
co i 
en • 


_q : 

O* : 
CH \ 


# 

o 
en 


# 

cn • 
in • 


1 


100% | 


# 

CD j 
OO : 


o~ i 
CD : 
CD ; 


#j 

ro • 

CO ! 


|100% j 


-@ = 
o" • 

co ; 
en • 


100% ! 


#! 

en i 


CD 


-O : o ; 

; o • 

; o : 

CD : «- : 


4 


_2 


4 


3 


z 


4 


3 


6 


6! 


1j 


4 


2] 


2! 


ll 


4| 


i! 


5| 


2\ 


2] ij 
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CDR III 



amino acid' SS£££g£o<cDOo^u.cDx--**:S 



A 


j — 
66 


j 2 


i i6 




{ 1 


j 1 


i 1 


I 4 


j 1 


I 2 


! 2 


! 1 


1 1 






I 1 


j 1 


j 2 






B 










































C 










I 1 


j 1 


{ 16 


2 




j 1 


j 1 


! 7 


2 


j 1 














D 








j 5 


I 3 




! 3 


5 


1 4 


I 3 


! 4 






j 1 


1 


I 14 








! 59I 


E 






! 9 








2 






i 1 






1 






| 1 










F . 










! 1 


! 3 




! 2 




3 


I 1 


! 2 




! 2 


| 1 








I 28 


! 2! 


G 




! 2 


j 14 


\ 13 


j 20 


! io 


j 14 


! 5 


! 20 


! 15 


! 16 


I 3 


1 3 


! 4 


! 15 


| 1 


! 1 


! 7 






H 




















j 1 


! 1 


i 1 




1 














1 








| 2 


5 


1 2 


; 2 




2 


! 2 


I 1 


1 






I 1 












K 




! 5 






2 


i 1 






1 
























L 




' 1 


4 


4 


2 


i 5 


2 


! 1 


1 




! 4 


! 2 




1 






1 




1 




M 






1 




2 




1 




1 






1 


1 












10 




N 








2 


2 


1 


2 


1 


2 


2 


2 


2 






1 


1 


4 








P 








20 


3 




1 


3 


2 


2 


2 


4 


2 


1 


4 


1 




1 




1! 


Q 








1 






1 




1 


1 


1 




















R 




55 


1 


5 


7 


8' 


1 


4 




2 




1 




16 














S 




1. 


1 


5 


5 


5'! 


5 


21 


5 


11. 


8 


4 


3 




2 


r 




2 




1! 


T 


1 


3: 


3 


5. 


4 


l! 


3 


4 


2= 


5 


2 




1 






ii 


1 








V 


JL 




3' 


2\ 


4 


3! 


3 


3 


4 


2; 


2; 


2 


1 


2 


1 












W 








l| 


1 


3| 


1. 


1! 






2| 




3: 








1! 


5! 


ii 




X 
























: 


















Y 




ij 




2] 


3| 


20l 


5! 




9! 


1! 


2! 


111 


20| 


10| 


6j 


9! 


10! 


7| 


1! 




Z 
















































ii 


2| 


2: 


3! 


6| 




11! 


141 


23! 


26| 


261 


3lj 


34I 


46! 


39! 


21! 


1! 


unknown (?) 8 
























1! 




1! 


i| 




2! 


3! 




not sequenced! 




2\ 


2j 


2! 


4! 


4| 


4j 


4! 


5! 


5! 


5! 


5! 


5! 


5j 


sj 


5! 


5| 


5! 


5! 


sum of seq' ] 
oomcaa 1 
mcaa 4 

rel. oomcaa 5 i 

pos occupied 6 j 


70! 


701 


68i 


68! 


68! 


66! 


66! 


66| 


66! 


65! 


65! 


65! 


65! 


65! 


65! 


65| 


65! 


651 


65! 


65j 


66! 


55! 


16! 


20| 


20! 


20| 


I6j 


21 1 


20| 


i s| 


16! 


23! 


26! 


261 


3 1 1 


34| 


46| 


39! 


28| 


59! 


a i 

O : 
: 

CD : 


R 1 


A ! 


P | 


6 j 


Y { 


C 1 


S j 


G i 




















F j 


D i 


i 

O" • 
CD • 


O" : 
^ : 
CM ■ 


O : 
O" ■ 
CT> : 

oi :■ 


.0 : 
O* • 
CT> ■ 
CM : 


,0 ■ 
O" : 
O : 
0 • 


| 

0~ ; 

: 

CM : 


i 

O* : 
CM : 
CO * 


,p ; 

O" : 

O! 
co : 


O" : 
C~> : 
CN = 


„0 : 
0~ : 

in : 

CM • 


O : 
0* • 
Lf> : 
CO : 


O ; 

O = 

: 


^p j 
0" : 
O: 
'd- : 


^p • 
O : 
CO ; 
• 


^p : 
O" : 
CM : 

tn : 


^0 • 


O : 
O : 
Oi 

cd ; 


0 : 
0 : 
n : 


.O ': 
O" : 

cr> i 


3| 


8| 


10! 


14; 


18! 


■15! 


18! 


151 


15! 


17| 


17| 


15! 


12! 


H; 


1l{ 


10! 


81 


7| 


6! 


61 
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ammo acid 1 2 2 9 2 



to n co o 
o o o o «- 



«— csi m 



A 


























B 


























C 


























D 




i 1 


I 1 




















E 


1 


I 1 






















F 


2 
























G 






1 58 




59 


1 


! 1 












H 

1 

K 

L 








1 


















3 
















4 














3 




! 1 














3 






1 






| 40 


i i 










M 


1 












! 3 












N 








1 


















P 

Q 


5 






















1 








52 


















R 








1 


















S 

T 






















53 


51 












54 


11 


i 


51 ! 




1 




V 


15 




1 








1 


54 




54 




1 


W 




59 




1 


















X 

Y 

Z 


























34" 




l! 


















— 


























l! 
























unknown (?) 


























not sequenced 


5j 


9| 


9| 


ioI 


111 


14! 


14! 


14! 


15! 


16! 


16; 


17 


sum of seq' | 


65! 


SI | 


611 


60! 


59j 


56j 


56| 


56! 


55! 


54! 


54| 


53| 


oomcaa 5 
mcaa* 

rel. oomcaa 5 ! 

pos occupied 5 j 


34| 


59! 


58! 


52! 


59! 


54! 


40j 


54! 


5l| 


54| 


.53; 


51 ! 


Y j 


W| 


G ; 


Q| 


G [ 


T | 


L I 


V | 


T | 


v ! 


S | 


S j 


-2 ■ 
o ■ 

CM • 

in • 


#j 

cn ; 


i 

O" : 
LT> : 
CD : 


: 

O" : 

: 

CO : 


hoo% | 


: 

o* • 
LO : 

o> ■ 


! 


o" : 
CO ; 
CD : 


o I 

o" ; 
n • 

CD : 


100% 


-5 : 

• 

CO • 
CO • 


^p i 

o* - 
CD ■ 

a> ; 


9i 


3! 


4| 


7\ 


1 j 


3| 


5; 


3! 


2! 


ij 


2! 


3| 



sum 
670 

165 
308 
297 
226 
928 
14 
286 
325 
386 
189 
176 
238 
494 
351 
972 
736 
699 
243 

542 

3 

578 
8 

406 



14 & 
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amino acid' — 
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A 


















32 














■ 34 










B 










































C 










































D 










































E 




1 






5 


1 
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F 










































G 
















27 














35 












H 






1 






















1 














I 








































1 


K 




3 


1 


















34 


33 












33 




L 






3 


26 


1 
































M 








1 


1 
































N 










































P 
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1 








Q 


21 
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2G 






























R 
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1 


2 
















S 














27 


















1 


34 








T 
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1 










2 




V 


3 
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20 












35 
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Z 




















































































unknown (?) 










































not sequenced 


15 


15 


15 


13 


13 


■ 13 


■ 13 


; 13 


6 


5 


5 


5 


5 


5 


5 


5 


5 


5 


5 


5 


sum of seq 2 
oomcaa 3 
mcaa 4 

rel. oomcaa 5 
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25 


25 


25 


27 


27 


: 27 


i 27 


' 27 


34 


35 


35 


35 


35 


35 


35 


35 


35 


35 


35 


35 


21 


21 


20 


26 


20 


; 26 


1 27 


; 27 


32 


35 


35 


34 


33 


33 


35 


34 


34 


35 


33 


34; 


Q 


V 


Q 


L 


V 


| Q 


1 S 


1 G 


A 


E 


V 


K 


K 


P 


G 


A 


S 


V 


K 


v - 
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o 

^* 

00 

3 


# 
o 

CO 

4 


o" 
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■ 
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; 

4 


i CO 
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; .O 
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! 
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| 1 
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• 
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.o 
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O 

1 


_© 

o* 
O 
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o> 


,p 

CD 
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cd 


.p 
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O 
O 


_p 
O* 
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cd 


_o 
o 
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CD 
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©* 
O 
O 


.p 
CT> 


.© 

o 
CD 


2 


2 


3 


1 


2 


2 
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1 


2 
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SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 6B: Analysis of V heavy chain subgroup 1B 



PCT7EP96/03647 



CDRI 



amino 3CI0 fMMrMMwcNNNNno^^nnnnnnn 



m co co 



A 








1 30 




: : 
: 1 








1 2 








I 6 












B 








































' C 




! 35 








: j 




























D 












: : 
I : 






: 


j 1 








! 5 




1 








E 






1 3 














1 1 




















. F 












I ! 2 




39 










1 2 


i 2 












G 








1 




\ 40 








1 


14 








1 












H 




























! 3 


i 1 




34 








1 
















i 1 




: 1 












I 9 










K 






28 




































L 


















! i 




1 










: 5 






; 2 




M. 
































■ 23 










N 














1 






1 


3 










1 


3 








P 






























i 












Q 






2 
















1 








1 




1 








R 






2 










2 












1 












37! 


S 


35 








40 






5 




2 


15 






2 


1 












T 
V 








3 








32 




34 










1 


















1 






1 






1 


1 








2 


2 






38 




W 




































40 






X 










































Y 














36 








1 






32 


19 




1 








Z 


































































40. 


•40 
















unknown (?) 










































not sequenced 


5 


5! 


5 


5. 


































sum of seq' 
oomcaa' 
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35 
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40 
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40 


40 : 
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40| 
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40 
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s" 
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28 


30= 


40 : 


40 
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32! 
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23! 
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a! 
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#! 
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CO : 


#1 
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CO : 


00 ; 

O : 


1000/0 j 


-9 i 

o" ■ 

0 ■ 
0 1 


-9 * 
o~ : 

O: 

09 ■ 


# 

CD • 

• 


-9 I 
O" • 

co ; 
in ; 


#1 

IT) • 
CO • 


#! 

O: 

0 ; 
l| 


: 

o" • 

LD : 

_.9?_i 
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1 


1 | 


4 


4 


1 


1 


M 


4| 


2\ 


61 
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5| 
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'5o 

SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 
Table 6B: Analysis of V heavy chain subgroup 1B 



PCT/EP96/03647 



Framework II 



amino acid 1 ^^^^^^^^^^^^^^^ ^^^ 
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39! 
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ij 








7! 






1 j 






B 










































C 
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, : 












l i 




E 
















39| 




















l| 


i 




F 
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1 










l! 






G 








39 




28| 










39! 


ij 






1 






9| 


1: 


39! 


H 








I 




























2 




— 1 


1 


























34 














K 










1: 






























• 


L 






1 








37 ; 












1 
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37! 




2 


4 
















N 




























35 








20! 


12 


1 \ 


P 




1 


34' 








1 
















31 












Q 


39 








39 






1 


























R 


1 
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: 3! 


i 




S 
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1 
















: 2 








1 


20 


— i 
— -i 


T 






! 4 
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3 
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1 


1 












W 


















40 






33 
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s — 














































I 40 


; 40 
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not sequenced! 








































sum of seq' 


I 40 


j 40 


1 40 


! 40 


|40 


1 40 


| 40 


j 40 


| 40 


j 40 


1 40 


j 40 


Uo 


j 40 


! 40 


j 40 


\ 40 


j 40 


; 40 


j 40 1 


oomcaa' 
mcaa' 

rel. oomcaa 5 
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j 39 


|39 


1 34 


! 39 


! 39 


1 28 


\ 37 


! 39 


| 40 


! 37 


! 39 


! 33 


| 34 


j 35 


1 31 


j 40 


! 40 
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| 20 

[T 
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j G 


i Q 
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i p 
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1 Q 
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A 


l| 2 






27 


2 




1 


1 




1 








2 








12l 


B 


1 

: 
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: 




































D 


i! 












i 

-"•—•4" 


4 














; 35 








E 
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1 




1 


1 












1 










F 






4 








39 












3 














G 


151 


6 




1 










34 






















H 




1 


1 


























1 






221 


i 




1 


















1 


1 


13 










K 


2! 2. 
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li 
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11 
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T 




35 
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38 
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X 










































Y 
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CO 
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"n 
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A* 
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10 


36 

If 

# 
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01 
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5 
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37 
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in 
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in 
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D 
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E 
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Z: 
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1 
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1 i 
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39 
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N 
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1 


2j 
























P 
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Q 








































R 


4 














2 
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37 




















S 
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1 








35 


20 




: 1 


36 












1 


1 




T 


1 


39 












1 






1 








40 
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1 
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33 
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38 


35 




Z 
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I 40 
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ij 


6| 




1! 




2i 


3| 


1! 


3| 




l| 
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sj 
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• C 
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3 








2| 
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D 






7 




5l 


2 


3 


l! 


5| 


4| 




Ij 




2! 


2| 


1; 








27 ! 


E 






2j 
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i! 


l| 




2! 




l! 




l| 












F 








1 


l! 


3 




3 

: 


2| 


ij 


1! 


1! 


l| 










2| 


is! 




G 




1 


7 


7 


Jy 


5 


9 


4; 


7| 


i! 


3! 




2 


2 


l! 




i! 




«i 


ij 


H 
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2| 






ij 


1! 














1 
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1! 


l! 


3| 


l| 


1 


l| 


1 


1: 
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2 
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4 


4 
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r 
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1! 


1 
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1: 
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6 


4 








1 


1 




3 


2 
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Q 
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1 


2 


1 














R 


1 


31 




5 


1 


1 


3 










1 




1 








1 






S 




1 


3 


3 


1 


4 


3 


6 


3 


2 


2 


1 
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T 




: 2 


1 


1 


2 


i 2 


1 


5 


1 


1 1 
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V 
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7 


1 
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1 


3 


! 1 
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2 
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1; 
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1 
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■ 1 




4 
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Y 








5 


! 5 


! 4 


2 


3 




4 


j 3 


i 3 


= 2 


i 1 


i 2 


!' 5 


! 6 


i 2 






Z 
















































i 


j 1 


1 1 


! 4 


! 6 


\ 8 


! 10 


! 11 


! 14 


! 20 


! 23 


! 25 


I 25 


j 25 


! 23 


i 18 


! 11 


i 6 
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I 3 




1 


I l 


1 3 


1 3 


i 3 


i 3 


i 3 


! 3 




! 4 


I 4 


! 4 


i 4 


! 4 


! 4 


i 4 


1 4 


! 4 
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i 4 


sum of seq' 


| 39 


• 39 


! 37 


! 37 


i 37 


! 37 


j 37 


1 37 


j 36 
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I 36 


! 36 


j 36 


! 36 


I 36 


| 36 


1 36 


! 36 


! 36 


j 36 


oomcaa 1 
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rel. oomcaa 5 
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31 
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i 8 
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j 23 
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R 
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. cm n ^ LO CD 

amino acid 1 2 2 2 2 2 



oo 

O O 



<y> o — cn n 
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D 
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i! 
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27 
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7! 
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N 
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P 
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R 
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S 
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18 


18 


T 












21 


6 




16 




1 




V 


6 














21 




18 






W 




; 29 






















X 


























Y 


11 








— 
















Z 




























3 
























unknown (?) 


























not sequenced 


4 


! 11 


j 13 


S 13 


\ 14 


! 19 


! 19 


: 19 


j 20 


! 20 


| 21 


I 22 


sum of seq 1 


! 36 


j 29 


I 27 


I 27 


I 26 


| 21 


1 21 


i 21 


I 20 


! 20 


! 19 


1 1 81 


oomcaa 1 
mcaa' 

rel. oomcaa* 

pos occupied 6 


\ n 


1 29 


| 27 


| 23 


\ 26 


j 21 


I 12 


i 21 
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| 18 
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| W 
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j Q 
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| T 
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1 
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s| 
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\ F 


| S 


rel. oomcaa 5 


£ 

IE 


; o~ ■ a* 

; CO | CD 
: CD : CT 


i 

: o* 
: CM 

; cn 


! * 

CD 

: CD 


i # 

: cn 
cn 


! il #1 # 

O ■ CO : CN 
* <— : CO : CD 


100% \ 


i # 

CO 
: CD 


# 

IT) 
• CD 


i «0 
: o~ 
CO 


! # 

! CD 


i „o 

i O 
: CO 
CO 


pos occupied' 


! i 


\\ 3\ ' 


x\ : 


j! : 




iS l! 7\ E 


>l 1 


I : 


l! 1 




i! L 


\\ 1 
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CDRI 



Frame* 



amino acid 1 S < 00 



CN/ro^fLOCD co cd O f - * 



A | 1 






1 7 1 


80: 




ij 






1 




187 




1 




B 
































C 
























1 




1 




0 


26 






3i 


7\ 




2| 


















E 


1 








10! 


















1 


1; 


F . 








5 
























G 


13 








31! 




1 










2 




209 




H 






4 






88] 


















1 


1 






l| 




15 






12 














K 


7 




















1 








202 j 


L 


3 










3 






2 


3 


1 


2 


1 






M 












193 




















N 


35 






8' 


3 




34! 


















P 








1 






1 










4 


191 






Q 


7 




















209 




1 




1 


R 


















207 




7 






8j 


S 


103 






: 17 


8 




72 










3 


14 






T 


9 








15 




10 










4 


5 






V 


2 








7 


1 






197 






2 








w 










30 






212 
















X 


1 






























Y 


1 






| 154 


19 




3 


















Z 




































! 210 


210 


























unknown (?) 
































not sequenced 


u 






! 2 


2 








1 


1 


1 










sum of seq' 
oomcaa 1 
meaa' 

rel. oomcaa* 

pos occupied 11 


| 210 


1 210 


210 


j 210 


210 


. 212 


212 


212 


211 


211 


211 


212 


212 


212 


' 212 


1 103 


| 210 


1 210 


\ 154 


: 80 


; 193 


88 


212 


197 


. 207 


209 


187 


191 


209 


= 202 


j S 






I Y 


! A 


i M 


• H 


W 


V 


' R 


Q 


A 


P 


G 


: K 


: O 
: CJ1 

• ->s- 


100% 


: o* 

o 

O 


„p 
: o* 

: 

: r^. 


■ o 
: O 
00 

n 


: CD 


: cT 
CM 


100% 


n 

CD 


o" 
CO 
CT> 


,© 

: o* 
C7) 


.p 
O* 
CO 
CO 


,p 

o* 
O 

CD 


.0 

cn 

CD 


: 

: O* 
CD 


! 14 


; 1 


| 1 


[ 9 


! 10 


; 4 


9 


1 


j 3 


j 3 


L 3 


9 


5 


4 


! 4 
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work II 



amino acid' 


■*• 




CD 




CO 
•tf 


cn 


0 

LO 


in 


CN 
tf) 


< 


CD 




CO 
LO 


LO 


LO 
Lf) 


A 


1! 








1 77| 


42 1 




1 j 


2| 




14I 




7! 1 


B 






O : 












l| 










C J ! 






















ij 




D I 1 




1 : 
1 : 












l\ 






94! 




3| 


E I | 














3j 


2\ 


ll 






i ii 


F 1 










7! 




2| 


ll 








ii 


8i 


G 


207! 








! 33! 


1 ll 




10| 


46 






M 


163| 


85f 


H 












D: 






l! 

!* 










1 










3! j 


-J ', 


191 < 




: 

1i 










1; 


K 














1 1 
1 : 


O / : 


7' 

c. : 


30 




3! 


1! 




L 




21 1 






s\ 


1 9": 
IZ: 


1 : 
















M 

™ 


1 










1 : 
1 


1 : 

1 : 














N 


--I 














7 
/ 


Q 


2 




13 


11; 


l! 


P 




1 














1 






1 






Q 






7 






7 






10 












R 


1 












1 
I 


17 


5 


1 




2 


j 1 G| 


s 


3 






: 1 
: 1 


\ \ 102 


\ 1 1 
• < 


Q 

J 


1 1 0 


43 




1 


74 


17| 


82! 




T 













O 


C 


A 






11 

1 J 


12 


3! 


3! 


V 






'* O 




1 204( 


AQ 


9 
L 




1 

1 










W 








: ZlU 




• 1 

: I 




ft 

: O 


C 
D 










X 




























3j 


Y 


















c;ft 

: DO 










b| 


Z 














































\ 14 


178 


j 178 


2 


! i! 




unknown (?) 




























not sequenced 




























sum of seq' 


1 212 


1 212 


! 212 


1 212 


! 212] 


212 


\ 212 


j 212 


j 212 


| 212 


| 212 


! 212 


j 212 


! 212I 


2121 


oomcaa 1 


| 207 


1 211 


I 1981 210 


! 2041 


102 


\ 49 


1 191 


I 118 


! 58 


! 178 


j 178 


j 94 


| 163j 


85] 


mcaa' 


• 6 


| L 


| E 


j W 


1 V j 


5 


i v 


1 1 


| S 


i Y 






1 D 


j 6 | 


"g""| 


rel. oomcaa 5 


O* 
CO 

i cn 


100% ! 


! # 

: CO 

■ cn 


i .p 

: 0" 
: O 
Ol 


i .0 

: o" 
CD 


*o 
3* 
CO 


i .0 
; S~ 

: CO 
: CN 


\ «P 
0 

CO 


! # 

: CD 

LO 


i O" 
: CN 


! # 

00 


i # 

• 

CO 


: J=> 

: O* 

: ^ 


: 0- 

■ 


#! 
0 ■ 

• 


pos occupied 6 


! 4 


! 2 


\ S\ 3 


j 3 


3 


! 15 


1 9 


! 11 


! 19 


! 5 


I 5 


! 12 


\ 9 


12! 
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CDRII 



amino acid* 


CO 
LO 


IT) 


00 

in 


CO 

in 


o 
to 


5 


CO 


ro 

CO 


co 


ID 
CO 


CO 
CO 


CO 


CO 
CO 


CO 
CO 


o 


A 


& : 


1 j 






174| 


33! 














i! 






R 
D 


i ! 

1 : 






























r 
































n 
u 


1 1 * 




17^ 






160; 




















c 

L 


O : 


3; 








l| 






2 














C 

r 


i ! 




3 


2 
















207! 








a 

VJ 


d 


1 i 

1 ; 






4 


si 








212; 


1! 










u 
1 1 


] \ 




4 


























1 


3- 


37 


2 










8! 










14i 


2081 




K 

IN 


1 j 


61 














199! 




si 










1 

L 


1 


1 


1 i 





1 














1 




ll 




M 
Ivl 


ft 








1 






















N 


51" 




4.! 

* : 






2 






2 














P 


1 


i 
i 






6 


8 


18 




1 














Q 


3 
















2 




2 










R 


5 


A 






5 









6 




201 










S 


48 




1 1 




4 




193 










2 


7 




211! 


T 


42 


97 


5 




7 
















189 




1| 




9 






i 10 


; 7 




204 








1 




3 




w 






9 

: Z 


























X 


4 




1 






1 
1 




















Y 


9 




: 1 O 1 


Z I u 






1 i 










1 








Z 
































































unknown (?) 
































not sequenced 
































sum of seq 1 


! 212 


| 212 


! 212 


| 212 


! 212 


j 212 


| 212 


j 212 


1 212 


| 212 


j 212 


! 212 


| 212 


| 212 


\ 212j 


oomcaa 1 


! 51 


j 97 


! 151 


! 210 


\ 174 


! 160 


j 193 


| 204 


| 199 


| 212 


j 201 


I 207 


! 189 


\ 208 


| 21 1 ! 


mcaa' 


I N 


| T 


j Y 


! Y 


j A 


\ D 


| S 


I v 


\ K 


\ G 




| F 


j T 


| 1 


\ S ; 


ret. oomcaa 1 


j 

: o* 


1 .O 
• 5* 
: t£> 


1 P^ 


I o 
: o 
: CO 
CO 


I # 

CO 


i # 


I ° 
\ CO 


i .o 

: o" 
CD 
CO 


1 ,p 

: O 
: CO 


1 # 
o 
o 


! # 

i to 

CO 


■ 

: o 
: CO 
CO 


: .O 
: o" 
• CO 
: CO 


; .p 

: O 
CO 
: CO 


iioo% 


pos occupied' 


! 19 


\ 12 


1 15 


1 2 


|_9 


j_8 


L_j 


j 2 


6 


j i 


I 4 


! 5 


! 5 


I 3 


\ 2! 



T4" 
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- cm ro ^ to £ £ 



amino acid 1 ^ ^ 



00 cn 



O 

CO 



— CM 

CO 00 



< CO U 



A 


! i i 57! j 




i! 


8l 












1= I 


B 














2! 








C 






















D 


\ 199; 1 381 ! 2!_ 


2| 






l| 








ioj 




P 1 


i si j ! A 




























13! 




























1| 


4! i 


H 




ij 






i! 




2! 




2! 




1 






2! 


2| 








3| 


1] 


ij j 


K 1 


\ j ! i 1861 


6 














3! 




1 1 1 \ \ \ \ 






188; 




209! 




3l 


I: 


i 212 1 




i! I 1 ! 2| 




! io! 


3! 




2| 


- ! 


205| 






N 


: b; l/U: ; *-\ 


I oo 










3 




i si i 


10! 


P 






I 1! 
















Q 


\ \ \ \ 7 












199 








• 

R 


211! i i ! 1 


1 














2! 


8! 


S 


1 i 1 153 i ! 


; io 


! 56 




; 3 








. 6| 


186! 


T 






! 142 








; 1 




4: 


2l 


V 












1 




! 1 






W 






















X 


-hhH- 


\ i 


V 












i 1 




Y 








j 194 












Z 








































unknown (?) 




















Inotsequencec 


! \ i! i! 


















sum of seq ? 
oomcaa 3 
mcaa' 

rel. oomcaa* 

pos occupied 


1 2121 212! 21ll2H|2i; 


i\ 212i 21; 


»!2i: 


i\ 21; 


>j 2i: 


2I 212! 21; 


?! 212 


j 21 2[ 21 


I 211; 199! 170; 153; 181 


si 188] 14; 


i\ i8i 


3! 19' 


ij 20' 


3l 199| 20 


5s 181 


j 186! 21 


PTFoTn \ H K 


| N 
3 ; c 

3 i t 


j T 


i l 


! Y 


! L 


j Q j M 


! N 


! S I L 


\ § ; 45 i 42 ; *s ! ^ 
■ — : co ■ oo ; • c 


?\ i 

t> ■ 

30 : U 


> ; 4 

" : o 
- : O 

3 * a 


A i 
i ■ r~ 


A i 

4 • a 

1 : O 


5 \ 4> : 4 

" : S* • O 
■5 : ^ : r- 
D ; CT> : O 


A # 

^ : tn 

■> ; oc 


! ^\ 9 

: S 
: CO : C 
: CO j ^ 


r '\ 2: 4| 31 


8.L...Z.L-. 


6; 


5; 


5| 


3! 6! 


4| r 


1 7\ 



t<fs> 
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amino acid' 


ro 

CO 


CO 


LD 
CO 


CD 
00 


CO 


CO 
CO 


CD 
CO 


o 

CD 


CD 


CM 
CD 


n 

CO 


CD 


m 

CO 


CO 

CO 


CO 


A 




149| 


ii 




i! 


207 1 










173! 


2| 


15l 


9! 




B 
































c 


















ll 


210! 




5l 


2! 




ij 


D 




5| 




209! 
















2| 


54l 


7j 




E 


1! 




190l 




















1l! 


2! 


11 = 


F . 














ij 




15| 






1| 




9! 


6i 


G 


1| 


l| 


6j 






4 


ij 








2| 


8| 


34! 


261 


35] 


H 




1; 














ij 










3! 


ll! 


1 




8| 










21 












4| 


15; 


io[ 


„..._..* 


30! 






















go! 


4 


3! 


5l 


L 














18j 










1! 


6! 


n| 


7; 


M J 








2I 




ij 














6| 


ij 


N I J 


ij 




ij 
















2! 


20* 


4! 


3! 


P 1 


9| 


















1 


3 


4 


29^ 


10! 


Q 








1 
















5 


3 


9 


2| 


R 


177 






















103 


9 


30 


19| 


S 




ij 






1 














3 


9 


8 


111 


T 


3 


; 28] 






. 207 




j i 








. 25 


15 


7 


6 


20! 


V 




1 9! 










j 187 








I 10 


1 


| 7 


7 




W 


























j 3 


4 


I 3! 


X 








j 1 
























Y 
















1 211 


j 194 








I 12 


\ 9 




Z 


























































| 1 


! 3 


I 4| 


unknown (?) 
































not sequenced 










1 


| 1 


j 1 


\ 1 










j 7 


i 12 


! 13! 


sum of seq' 


! 212 


| 212 


\ 212 


| 212 


i 211 


\ 211 


| 211 


j 211 


| 211 


! 211 


i 211 


I 211 


| 205 


j 200 


I 199! 


oomcaa 1 


| 177 


! 149 


j 190 


| 209 


| 207 


j 207 


j 187 


j 211 


! 194 


I 210 


1 173 


! 103 


! 54 


i 30 


j 35| 


mcaa' 


1 R 


j A 


! E 


I D 


| T 


\ A 


i v 


i y 


j Y 


\ C 


1 A 


! R 


| D 


1 R 


I g "| 


rel. oomcaa* 


j O 

: o 

: n 

CO 


1 ,p 
o 


: O 
: O 
: O 
: CD 


\ „p 

: O 
CD 
CD 


: .0 

: o" 
: CO 
CD 


1 .p 

: o 
CO 
CD 


.0 

: O 
: CD 
CO 


! # 

: O 
O 


• -9 
: o" 
CN 
CD 


1 .0 
: o~ 
O 

1 ° 


• -9 

• 3" 
■ CN 

CO 


1 .© 

: 0 

co 


; 0 

: 0" 
CD 
CM 


• -S 

: O 
: . CD 


': ! 
■ O : 
: CO : 


pos occupied' 


*: C 


.! 10 


1 4 




[\ t 




![ 7 


\ 1 


\ ^ 


! 2 


I 5 


.! 14 


A IE 


j 2C 


>j 21: 
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CDR Ml 



amino acid 1 g£o< C ouou J u_ox--.^o 



A 


7 


13 


! 7 


i 9 


! 6 


2 


! 3 


5 5 


I 5 




! 9 




! 13 




2 


B 
































C 


13 


5 




1 


2 


'■ 11 


! 3 




I 2 










1! 




D 


11 


7 


10 


4 


2 


3 


10 


: 3 


! 3 


! i 




! 3 


! 2 




146 


E 


6 


3 


1 


! 13 




1 


1 


















F 


3 


5 


4 


5 


5 


6 


3 


5 


7 


2 




1 


1 


65| 




G 


34 


17 


: 35 


17 


14 


23 


! io 


5 


1 


5 


3 


1 2 


! 32 




6: 


H 


3 


4 


! 3 


2 


9 


2 




1 


3 


1 


! 2 


! 8 


1 






1 


6 


11 


4 


4 


3 


1 


3 


10 


1 3 


3 


! 2 




1 


2! 


K 


2 


11 






3 


1 




















L 


26 


13 


4 


12 


8 


2 


6 


3 


10 


3 








2! 


i; 


M 




1 


2 
















\ 1 






32; 




N 


4 


6 


4 


3 


2 


2 


6 








; 2 


5 






2| 


P 


6 


5 


5 


6 


9 


8 


2 

1 


3 


_2 


1 





__! 
1 








Q 


4 




1 


1 


1 


1 


R 


4 


10 


9 


7 


5 


5 


2 


3 


1 




1 




2 




4; 


S 


16 


28 


27 


25 


24 


8 


11 


9 


3 




2 


3 


1 




i! 


T 


6 


12 


9 


17 


17 


1 


2 


5 


1 


9 


3 


1 








V 


13 


7 


15 


4 


3 


6 


2 


12 




1 


1 


1 


1 






W 


6 


5 


6 


7 


2 


4 








1 




6 


10 






X 








1 






















1; 


Y 


16 


14 


17 


5 


8 


18 


20 


13 


20 


25 


28 


32 


28| 




Z 


































12 


21 


35 


54 


73 


87 


102 


110 


126 


135 


134 


120 j 


91! 


71! 


21| 


unknown (?) 














3 


2 


1 


1 






3| 


2| 




not sequenced 


14 


14 


14 


14 


15 


19 


21 


22 


23 : 


23 


23 


25! 


25; 


26; 


25; 


sum of seq J 
oomcaa' 
mcaa 4 

rel. oomcaa 1 

pos occupied' 


198 


198 


198 


197 


196 


192 


190 


189 


188j 


188 


188. 


186! 


186| 


185j 186| 


34 


28 


35' 


54 


73 


87: 


102 


110 


126! 


135 


1341 


120; 


91 1 


7lj 146| 

"d 1 

o~ : o~ 1 
CO : CO • 

ro ■ ; 


G 


S 


G 




















cr> - : 


O" * 


jp 
& . 

-3- 


#• 

CO . 


O" 
CM 


.0 
©• 
1^. 
n 


.© 

o* 
in . 


.© ' 
in . 


# 

CO 

in 


# 

CD i 




* 


#1 

in : 
cd ; 


20 


20 


19 


20 


19 


20 


17 = 


14 


14| 


12 


12! 


13.j 


12] 


8j llj 
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Framework IV 



CM CO "2- 

amino acid' 2 2 2 



LO 

o 



CD r-* co 
o o o 



o 



° ^ ^ 52 sum 



A 


1 




1 






2 














B 








1 


















C 


























D 


2! 
























E 










1 
















■F 


2 
























G 






140 




130 




1 












H 


4 
























1 


















1 


1 






K 


























L . 


loj 






r 






91 










2 


M 














6 












N 


r 










1 














P 


17 










1 


1 












Q 








in 


















R 








8 


















S 


7 


1 


















118 


110 


T 












123 


27 




122 






1 


V 


34 




1 






1 




125 




119 






w 




158 






















X 


























Y 


82 
























Z 




























9 


2 


! 2 


- 2 


2 


2 


2 


2 


2 


2 


1 


• 1 


unknown (?) 


























not sequenced 


27 


! 50 


\ 67 


| 75 


1 78 


81 


! 83 


j 84 


86 


| 89 


92 


j 97 


sum of seq J 


! 184 


. 161 


j 144 


| 136 


| 133 


j 130 


| 128 


j 127 


i 125 


! 122 


119 


\ 114j 


oomcaa 3 
mcaa' 

rel. oomcaa 5 

pos occupied* 


j 82 


1 158 


1 140 


| in 


| 130 


| 123 


! 91 


I 125 


1 122 


i 119 


118 


! no! 


1 Y 


j W 


j G 


\ Q 


I 6 


I T 


j L 


! v 


j T 


1 v 


S 


1 s i 


o 

in 


CO 
CD 


; .0 

: o 

• cn 


CN 
CO 


• G* 
: CO 

cn 


■ 

: o 
CD 


I 


: o" 
CO 


i .© 

: o 
: CO 
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\ 18 


! 18 


! 13 


I 1 5 


! 13 


! 10 


I £ 


j 8 


\ I 


)i 4| 4| 
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Framework IV 



amino acid 1 



OOOOOOOO'- 



A 












1 




l 








B 
























C 














i 










D 
























E 














: 










F 














: 










6 






41 




40 


1 














H 


1 
















1 








1 


9 










1 














K 








3l 


















L 


4 












1 J 












M 


























N 












1 














P 
Q 


3 






2 
















2 








29 


















R 


1 






4 






1 












S 


1 






1 














36 


33 


T 








1 




33 


8 




34 








V 


12 














36 




! 36 






W 




46 






















X 


























Y 


16 
























Z 




















































unknown (?) 


























not sequenced 


1 10 


I 11 


1 16 




17 


20 


j 20 


. 21 


21 


! 21 


• 21 


1 22 


sum of seq' 


! 47 


1 46 


1 41 


j 40 


40 


37 


| 37 


36 


36 


| 36 


■■ 36 


j 35 


oomcaa 1 
mcaa 4 

rel. oomcaa 1 

pos occupied 6 


I 16 


1 46 


41 


j 29 


40 


33 


! 19 


: 36 


34 


I 36 


! 36 


! 33. 


\ Y 


j W 


! G 


\ Q 


G 


: T 


j L 


V 


T 


! v 


: S 


! s | 


: _Q 
• o* 
: <^ 

L .£2. 
! 8 


: O" 

•: o 
! ° 

! 1 


\ 

: O" 
: O 
1 ° 

1 


i 

■ o" 
: f> 

• 


• o 
o 


: .O 
: O* 
: CD 
: CO 


: .p 
: O 

i id 


• o 
o 


.p 

o 


j o 

= O 
: O 


: o" 
= O 
: O 


- 

: o* : 
• ^ : 
: (D : 


! 6 


j 1 


1 5 


I 4 


: 1 


! 3 


j 1 


1 


! 2! 



sum 
332 

113 
210 
176 
135 
674 

45 
282 
278 
540 

43 
204 
281 
334 
250 
986 
532 
488 
267 

455 
1 

466 
4 

426 
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Framework I 



acid 1 - Mn^mu)NCoo^2^ 



. A 










1 1 






1 j 


89: 




1 : 
l { 






1 ; 














B 










































C 










































D 




















2| 






















E 


881 


















93; 












92 1 








™7-j 


F 












——■■■! 


j 




















1 j 








G 
















92j 














94 












H 










































1 




































— 


/ / 




K 
























94; 


94; 










L 








I 91 j 




2 


























I 


{ 


M 






















3! 














N 
P 


— . 







j 1 









































i 










94 












™ — j 


Q 


. 3 




92 




1 


90 




















3! 






1 


1 


R 












1 












1 


1 




1 








17 


i 


5 














92 




















94 








T 








































V 




90 






89 








1 




91 




















w 










































X 










































Y 








































[—•■j 


Z 
















































































unknown (?) 






































not sequenced 


5 


1 5 




\ 5 


! ^ 


! 4 


! 4 


1 4 


i 2 


j 2 


! 2 


! 2 


I 2 


! 2 


! 2 


! 2 


! 2 


i 2 




\ ii 


sum of seq 7 


j 92 


j 92 


\ 92j 92 


1 93 


| 93 


\ 93 


! 93 


! 95 


j 95 


j 95 


I 95 


I 95 


j 95 


| 95 


j 95 


! 95 


1 95! 96 


1 96! 


oomcaa 1 
mcaa* 

rel. oomcaa 5 

pos occupied'' 


1 88 


j 90 


! 92! 91 


! 89 


j 90 


| 92 


\ 92 


j 89 


j 93 


I 91 


j 94 


j 94 


! 94 


j 94 


j 92 


j 94 


| 95j 77 


1 96! 


| E 


j V 


\ q| l 


; v 


i Q 


| S 


\ G 


j A 


| E 


j V 


1 K 


! K 


! P 


j G 
j # 

• CD 
:_CD 

! 2 


{ E 

: O 

: o* 

: 

j CD 

1 2 


i S 

: 

: O* 
: CD 

\ ay 
\ 2 


j L j K 

1 *P : 

; o" ; o 
; o ; 3- 
• o; o 

! il 4 


! 1 \ 

'■ -9 i 
: o :• 

: O: 
: O: 

1 i; 


i # 

: CD 

\ 3 
• 


1 # 
; co 
•: en 


100% ! 
99% ! 


; o* 

: tD 
■ CD 


I # 
• o> 


i ^ 

: o 
: CD 
: CD 


• -5 

■ o~ 

: CD 
: CD 


I £ 

: ^* 
\ CD 


; „p 
: o* 
: CO 
: CD 


If 


\ .O 
: o 
i CD 
: CD 


I # 

: CD 
: CD 


\ § 

: CD 
: CD 


1 3 


\ 1! 2 


1 4 


! 3 


1 2 


| 2 


! 4 


| 2 


\ 3 


! 2 


l 2 


I 2 
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CDRI 



amino acid' 


CN 


CM 
CN 


co 

CN 


CN 


CO 
CN 


co 

CN CN 


CO CD 
CN CN 


0 




< 


CO 


CN CO 
CO . CO 


co 


LD CD r*. CO 
CO CO CO CO 


A 








1 2 






I 4 


H M I 


! i si j lj 


B 




















C 




| 96| 












1 1! j i | 




_D_ 












I 2| 




I 21 i ! i 


! Ml \ \ 


i 

uj i 

i 


.„ 








! 2! 






I il I l.l 




F . 




: : 




1 3 


i 1 6 


i 1 97 




I i ! 1 % 






G 






92 




93; 






Ml 1 l.l 




1 72! I j 


H 
















l 1 






! 4! 


-J-' •«* — • «- 

I ! 1 1 ! 1 


_l_ 






i 










I 4 




1 93! | I I 


K . 






! 89 










i 












L 






















1 1 1 


1 1 1 1 2l 1 


M 




1 ! 1 




















1 1 


1 1 1! I 


N f 


! I i 








i 2! 


i 4 


14 






! 2! 






P 








! v 










I ! I ! l| 


Q 




\ 4 
















R 




I 1 






lj 


2! 




Mlh 


! ! 1 1 95| 


S 


94 




1 


90| 




84| • 


10! 


61! 






2! 2 


! 1 si 1 I ! 


_ T 


2 










5! I 


75: 


16l 








2! 


1! i 1 i 


V 


















ij 


I ! 93! I 


w 
















1 ! ! 1 93] 


I i 97! \ 1 


X 




















Y 










[90 






1 1 1 87| j 




z 


































97| 


97! 






unknown (?) 
















not sequenced! 


l! 


1 1 


l! 


i j 

1 : 


lj 


l! 1! 




1 






sum of seq ? i 


96! 


96! 


96! 


96| 


96j 96! 96! 97| 97} 


97] 


97! 


97[ 


971 


97| 97 j 


97! 


97! 97! 97; 97; 


oomcaa 1 j 


941 


96| 


89! 


92 1 


90) 93| 90l 84| 97| 


75j 


61! 


97l 


97| 


87| 93| 


93! 


72! 97! 93! 95! 


mcaa 1 \ 


S | 


c j 


K j 


G| 


S | 


G | Y | 


S I F | 


t 1 


s j 






Y j Wj 


1 | 'G 1 W f V ] R 1 


rel. oomcaa' j 


#! 

CO : 
C7> : 


O : 
O j 


#! 

co : 


-5 
o* : 
tO r 
CT> • 


#! 

CT> ■ 


#! #l 

: ^ • 
cn : O [ 


o~ ; O ; 

: O i 
CO i ^ 1 


#1 

: 
: 


#1 

CO • 
CO • 


^0 ; 

0 • 
O : 
O j 


#i 

O j 
O : 


#1 s\ 

O i CO : 
O ! CO ■ 


#1 

tO : 

cn f 


: ,P \ I \ 

1 9^ 1 >S ; *S •: 

O* • O • ; &■ : 
■ O J CD j CD : 
* «— : CD : CO = 

si ij A 3| 


pos occupied* \ 


2| 


lj 


5: 


3| 


4| 


3! .2! 


7] jl 


5[ 


81 


lj 


lj 


51 4| 


4! 
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Framework II 



acid ^^^-^^-^-^-^■^•^-^" u ^ Lr)Ln Lr)inu0 



A 






1! 






1; 


















1 j 






oil! 1 


B 




































1! ! ! 


C 




























1! 








D 




























14! 








8| 93| j 


E 


— | 


— i 


— i 





3! 






97! 




















: 2: : 


F 




— i 
96! 








: 








2! 










G 








97 












95j 
















H 
1 

K 


— ; 

: 


























3 


l| 






— i 

i! 


— ! 














1] 




75! 


92 






— 


L 

I 




— 


94; 
























f [" j 

-4 f j 


L 














94! 






2! 




2 


1 






— 




M 

N 
P 




92! 
















89! 






1 



































— 
1 












96 








2 














93! 






\ ! 1 i 


_Q 
R 


97 












1 






















, i.. i- ; 




! 1 


















1 


14 












i| i ! 


S 
























j 1 






1 






j 16: ■ 9b; 


T 




j 1 




















! 3 


; 1 




1 








V 




I 2 
















5 


i 1 


1 


j 2 












W 


















! 94 




















X 






































Y 


















1 3 










76 










Z 




































]" "T f : 


































\ 97 


1 97 


1 L L \ 


unknown (?) 






































not sequenced 




































sum of seq J 


j 97 


| 97 


1 97 


| 97 


\ 97 


j 97 


j 97! 97 


! 97 


I 9 7 


I 97! 97! 97! 97 


\ 97 


1 97! 97 


\ 97\ 27\ 97j 


oomcaa 1 
mcaa' 

rel. oomcaa* 

pos occupied" 


| 97 


j 92 


j 96 


! 97 


j 94 


! 96 


| 94! 97 


j 94 


j 89 


! 95 


1 7 5 Li? 

T iTT 


! 76 


1 93 


j 97l 97 


\ 69! 93; 96! 


; Q 


j M 


I p 


I G 


j K 


i 6 


! U E 


j W 


j M 


! G 


\ Y 


1 P 




j G j D j S j 


\ # 

i o 
! ° 


i # 

• in 

: CI 


\ # 

• 01 


li 

: O 


*• -9 

: O" 

: 

: 


i # 

• o 


1 -si £ 
&• o 

• • o 

• o> • — 


i 4? 
i 5* 
■ t^* 

: CT> 


i # 

■ cn 


: O : : .O : 

: O" : 0~ : O* : O 

' CO * r*» ■ LD • CO 

i cn : • cn ; 


: 

: o* 
: <0 

cn 


j 1 OOO/o 

hoo% 


\ .pi : «0 : 

; 3* ; 0 * 0 ; 

: — : CO : CD ■ 
; : Cn : OT 1 j 


j 1 


1 5 


•i 2 


i 1 






j 3] 1 


L ? 


| i 


j 3| 7]jh 


Li 


\ 1; 1 


! 6! 4j _ 2j 
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CDR II 



amino acid Lnu^inu^tocoocDiotocDtocDCOr^r^r^r^r^r^ 



A 




1 6 










j 1 


















j 88 










B 










































C 










1 










! 1 






















D 


77 


















1 2 














1 97 








E 


3 
















2 


















! 2 






F . 








I 2 








I 91 




1 




j 1 




3 














G 


1 


















194 




















•i — t 


H 
1 






















i 15 






















; 4 


1 










1 








3 




1 88 












1 91j 


K 






| 2 






























93 






L 












1 




4 














! 2 












M 




























3 












! 1! 


N 


2 




\ 14 


j 2 


































P 












95 


1 




1 
























Q 


















f91 




81 














1 






R 






78 












3 




1 






1 








1 






5 


2 


2 






95 


1 


95 


1 










1 




1 95 








j 9G 


\ A 


T 
V 




85 


2 




1 
















96 














! 4! 








1 
















93 




2 




9 










W 











































X 




































Y 


12 






92 




















— 





— j 











Z 






































































unknown (?) 










































not sequenced 










































sum of seq 7 
oomcaa' 
mcaa 4 

rel. oomcaa 5 

pos occupied' j 


97 
77 


97! 


97j 


97! 


97! 


97- 


97l 


97! 


97! 


97! 


97! 


97! 


97! 


97= 


97! 


97! 


97! 


97 


1 97! 97; 


85! 


78| 


92! 


95| 


95; 


95! 


91! 


91! 


94! 


81 1 


93! 

V j 


96] 

T] 


88 

71 


95 j 


88! 

~a] 


_97] 

~d'\ 


93 


96! 91! 

TTTI 


D | 

S ' 
o • 

Q> ! 

: 


T | 


R ! 


Y ! 


sj 


p \ 


S | 


F ! 


Q; 


G | 


Q! 


-2 '• 
O* ■ 

00 ; 

CO : 


•9 5 
o 1 

O ■ 
CO : 


o • 
LT) : 
CD : 


,0 • 
O ■ 
00 ■ 
CO • 


s ; 

©"* : 
CO : 
O : 


-9 I 

©" : 
00 : 
O) ■ 


o i 
o" : 
: 

O) 1 


-2 ' 
0* 1 
^ : 
CO : 


j 

O* i 
r->* : 
CD : 


-2 i 

0* : 
CO : 


-9 : 
0 : 
CO • 

ao ; 


-9 : 

0" ; 
cn • 
cn ; 


cn i 


Q 1 
0 : 
00 • 

cn ; 


0 1 
cn I 


O* ; 
O ■ 
O j 


& 
CD 

cn 


,0 : : 
O : O ■ 

CT> J !' 

cn ■ cn ; 


6; 


4j 


__5j 


4j 


3] 




3| 


4 i 


4| 


3| 


3! 


3| 


2I 


5| 


2! 


2j 




4 





/ S>2_ 
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add 1 SP:SScococo<^ 0 Scocococooococ^a>o> 



A 




1 


9ii 
















1 j 


96| 








93! 






B 






































C 














1j 






















951 


D 








l| 




















96; 










E 












1! 










ij 










— | — f 






F . 






> — j 


l! 


























2; 6 




G 














3| 


lj 








j 


— 1 
. — 4 


— 
2 


4| j 






H 


























— L„.„) 
j 9\ 


— I — 
-« -4 — 


— j 
- j 


1 




























K 






















91 






— i 





\ 1 ! 




L 


— 








96j 










97 










! 2 \ 


— f — 


■ — \ 


iyi 







I 


























\ 84| 


N 












2 


2 












2 




p 






1 1 
























— 








Q 












93 
























R 














1 


1 


3 




3 
















S 


87 


j 2 


j 1 


1 








90 


91 








96 




5 








T 


2 


! 94 


1 2 










1 






1 


I 1 


1 




88 








V 






j 2 




1 


















1 










w 














| 95 


















_JL_ 


































Y 








I 94 


























94! 89 




Z 












































































unknown (?) 






































not sequenced 


































l! 2 


i 2! 


sum of seq 7 


j 971 97! 971 97 


j 97 


I 97 


j 97 


1 97 


| 97 


j 97| 97 


j 97 


i 97 


I 97 


I 97! 97! 97: 96! 95| 95] 


oomcaa 1 
mcaa 4 

rel. oomcaa s 

pos occupied 11 


| 87| 94l 9lj 94 


L?6 

|T 


j 93 


j 95 


j 90 


i 91 


| 97i 91 

jTIT 


[96 

)"a" 


[96 

FT 


j 96 

I'd 


88"! 93! 84! 94] 89[ 95| 


|TjT| A 


IT 


! Q 


I w 


i s 


j S 


I T 


I A I Mi Y \ Y \ C ! 


: O : 

L.cn±_cn 

1 4 L: 


: .O 
: O" 
: ^tf" 

: cn 


i .0 

: O 

• 

: CD 


i # 

; cn 


1 .0 
: 0" 

; u> 
• en 


! # 

: CO 
: cn 


: 0* 
: cn 
• cn 


\ <P 

: ^ 

; cn 


•: 0 - 5- 
• O: «d- 

: <- : CO 


i # 

• cn 


i # 

; cn 
• cn 


i ^p 

: O" 

• cn 

• cn 


1 >p 

i a 
\ cn 


■ -p '■ -p 

1 ©~ ! 3" 
: O • 1^. 
: CO • CO 


.0 ; ^0 : 0 ; 
5- : 0; ; O • 

00 ! <tf i O i 


)\ 5 


\ 4 

...... 


; 2 

„• 


! 3 


! 3 


1 5 


j 4 


| ij 5 


1 2 


i 2 


I 2 


\ A 


\ 2 1 5 



/ 513 
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CDR III 



amino acid' 5££g£c3g§< c *^<=>^"-^^ 



A 192 




j i 


1 1 


1 2 




1 3 


I 4 


! 3 


! 2 




1 1 


j 




1 






1 4 




! 2| 


B 










































• C 












1 


| 1 


! 1 






j 2 




i i 
















D 








3 


i 3 


i 3 


! 3 


| 1 


! 2 


1 


| 1 






1 2 




| 1 


i 2 






j 37| 


E 






I i 


! 1 


! 1 


! 2 






| 1 


i 1 








I 1 






I 1 








F 










! 1 




I 3 






i 3 


2 




i 1 












| 2G 




6 






i 


I 9 


n 


! 12 


12 


[ 5 


I 2 


4 


i 3 


L 10 


I 2 


I 1 








Li 






H 






1 10 


! i 




2 






| 1 


1 




! 1 






1 






1 








3 




1 2 


: 2 


1 


1 


4 


! 1 


\ 1 




1 


K 
L 




; 1 


i 


1 




1 


3 


1 
















2 














n 


2 


3 


1 


1 


; 2 


! 5 




• 1 




1 




1 












M 










2 


1 


1 




1 


1 


I 1 


I 1 














! 10 




N 








1 




2 




1 


1 


2 






1 










2 






P 






5 


1 


4 


3 


1 


2 








1 1 


1 


1 


1 












Q 




1 


3 


2 




1 


1 


4 


2 


1 


2 


















3! 


Ft 




92 


7 


9 


2 


2 




2 


1 




2 




















S 




1 


1 


3 


2 


6 


4 


4 


5 


3 


5 


3 


2 


2\ 






1 




1 




T 


1 




1 


3 


2 


1 


2 


6 


3 


3 


6 


i 




l! 














V 

W 


2 




2 


4 


4 




1 




1 


2 






1. 




















1 




2 


1 










1 




2j 




1 




1 


1 






X 










































Y 








1 


6 


3 


6 


9 




7 


2 


1 


2| 


6! 


8 


9! 


9: 


10 




1! 


Z 






















































1 


1 


2! 


8i 


10 


16. 


23| 


30! 


30; 


31 


32! 


3oi 


22 


7| 


2! 


unknown (?) 


























1 






1; 


ll 


1 






not sequenced 


2 


2\ 


52^ 


52! 


52| 


52 1 


52 


521 


52! 


52 


52| 


52! 


52| 


52! 


52! 


52! 


52! 


52! 


53| 


52| 


sum of seq' 
oomcaa' 
mcaa' 

rel. oomcaa* 
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95" 


95j 


45! 


45| 


45! 


45! 


45. 


45l 


45! 


45 


45i 


45! 


45; 


45f 


45| 


45j 
32! 


45] 
30! 


_45 

22| 


44j 

26j 
7] 


_45| 
37| 

~D~j 
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92! 
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9| 
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12 
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9| 


8: 


10. 
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16 


h| 
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8| 
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1 
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D 
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F 
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8| 
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R 








3 
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40 
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T 
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V 
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| 43 
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Y 
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— 
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- 
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52 
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! 56! 


56 
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41 
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41 


i 41 
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1 43 
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40 
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40 
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j 40 
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T 
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404 
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44 
588 
650 
549 
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64 
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1545 
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594 
432 

738 

635 
4 

1678 
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51 
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T 


































68 




: 


V 
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61 


6 
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6| 




6 
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6| 


6| 


6i 
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mcaa 4 
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52i 


52! 


52! 
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52| 


52! 
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68! 
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51 1 
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68j 
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.0 ! 
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O: 
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cn ■ 1- ; 
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1! 


2| 
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1; 


1! 
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fM n ^" CD f*"* CO 
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i| 
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67| 
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68 
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1! 


1] 








i 
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69 1 














3| 


1| 


2\ 










. j 


H 






















— 












1 j 
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2! 
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70 
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— 


3\ 
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f— -•! 
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1 








2 


\ 66! 










70! 








P 
Q 


















































































R 






















2 


j 1 
















! 7A\ 


S 


l 






1 


69 






! 69 




68 


66 




67 




3 




ij 








T 
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2 


| 1 
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V 






1 


! 4 










! 70 










! 6 










2 
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! 74 
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! 69 
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| 69 
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1 70 
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I 74 
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! 74 


\ 74 
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\\ 74 
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rel. oomcaa 1 
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I 67 


\ 68 


j 67 


1 64 
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! 6G 
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I 66 
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1 70 
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! 
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: 
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! a 
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| 0 [ S 
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! N 
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: cn 
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": 5" 
: C> 
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i o 
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■ .p 
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I 
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: o 
: O" 
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\ : 


Si 6 
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| 4 






1 5 


! i 
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amino acid' 


n 


o 


5 


CN 


CO 




LO 
^- 


CD 




CO 

^- 


CD 


O 

to 


in 


CN 
LO 


< 


CO 




CO 


in 


m 

LO 


A 






ij 














1 1 






j 1 




B 






























C 






























D 






























E 












74| 


















F . 






















! 2: 




1 




__G 








74 








74 


1 












H 
























1! i 






1 






: 
: 






























K 


1 




i 1 






























L 


1 








74 




74 
















M 










































N 






































1; 




P 




\ 73 


























Q 


72 




























R 






I 73 












73 






! 72| 




ij 


ij 


S 






1 


" 73! 


















I 1! 


72 




T 




















73 








5j | 


V 






























W 












1 74 
















\ 73| 


X 






























Y 






















72! 


72! ! 






Z 




















































I 74 






unknown (?) § 




























not sequenced! 




























sum of seq' 


74 


74} 


74 


74! 


74 


74 


74 


74j 


74 


74 


74 


74 


74; 


74! 74| 


74] 74 


74: 


74| 


74; 


oomcaa' 


72 


74| 


73 


73 


73 


74 




74] 


74 


74 


74 


73 


73 


72] 


72] 


72] 


74 




6G| 


73 j 


mcaa 4 




'Tj 


p" 


"Tj 


T" 




* r 


T] 


"w 


T 


"g 




"Tj 


"y1 


y] 






Tj 


Tj 


wj 


rel. oomcaa 5 


ay 


#l 

O * 
O : 


CD 
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*p i 

cn : 
cd ; 


CD ■ 
CD 


o 
o • 


# 

o 
o 


i 

o ■ 

O ■ 


# 
o 
o 


O" 

O 
O 


#! 

O : 
O : 


#! 

CD = 
CD ■ 


#! 

CD : 
CD ; 


#! 


i 

O* i 

: 

CD : 


#! 

CDJ 


1000/0 j 


: 

_CD : 


-9 • 
0 • 
CD \ 
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cd ; 
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3 


1j 


2 


2| 


2 


1 


1 


]\ 


1 


1 




2 


2| 


2j 


3| 


3! 


1: 


_3j 


5[ 


2! 
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CDr^cocnO^CNro^i^tDr^coc^O^rNiroyirj 
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A 










73l 


1! 
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1j 






B 
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1| 


































D 
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l| 
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E 


1! 




3j 






7\ 
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F 


7! 








































G 






l| 














8! 






















H 
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1- 
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i\ 
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2 


j 7lj 








r 






K 


i 
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— 
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L 


il 
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2| 








i — 
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1 


















N 


2: 


65! 


1 
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69! 
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1 


1 












i — 


— 
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— 


— 


.... — ... 
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R 
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S 


2 
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73 
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2 
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T 




4 






















69 
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71 


1 


2 
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72 
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Y 


60 
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Z 
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i ~p 

i o" 
! r~- 
• Cf> 
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CD 
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: 
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N 
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4; 
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o 
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1 
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D 






19 


4 


3: 


7 


4 


3 


1! 


6| 


1 


1 


1 
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E 






10 


4 


2 


1 


2 


2 


1! 


2| 












— 


1 








F 
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1 


1 


1 




1 


2 


3 




2 






1 








38 


! 41 


G 


1 
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4 


15 


15 
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8 


6 


2 


5 


1 


8 


6' 


1 
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1 


1 


1 
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1 


1 
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— 
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1 
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5 
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1 
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1 


1 


1 


1 
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1 


8 


4 


2 


3 


2 
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1 
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c 
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3 








P 
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5 
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5 
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1 
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6 
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1 
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3 
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7 
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33 


41 


47 


53 
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57 


56 


5C 
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12 
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Claims 

1 . A method of setting up one or more nucleic acid sequences encoding one or 
more (poly)peptide sequences suitable for the creation of libraries of 
(poly)peptides said (poly)peptide sequences comprising amino acid 
consensus sequences, said method comprising the following steps: 

(a) deducing from a collection of at least three homologous proteins one 
or more (poly)peptide sequences comprising at least one amino acid 
consensus sequence; 

(b) optionally, identifying amino acids in said (poly)peptide sequences to 
be modified so as to remove unfavorable interactions between amino 
acids within or between said or other (poly)peptide sequences; 

(c) identifying at least one structural sub-element within each of said 
(poly)peptide sequences; - - 

(d) backtranslating each of said (poly)peptide sequences into a 
corresponding coding nucleic acid sequence; 

(e) setting up cleavage sites in regions adjacent to or between the ends of 
sub-sequences encoding said sub-elements, each of said cleavage 
sites: 

(ea) being unique within each of said coding nucleic acid sequences; 

(eb) being common to the corresponding sub-sequences of any said 
coding nucleic acids. 

2. A method of setting up two or more sets of one or more nucleic acid 
sequences comprising executing the steps described in claim 1 for each of 
said sets with the additional provision that said cleavage sites are unique 
between said sets. 

3. The method of claim 2 in which at least two of said sets are deduced from the 
same collection of at least three homologous proteins. 

4. The method according to any one of claims 1 to 3, wherein said setting up 
further comprises the synthesis of said nucleic acid coding sequences. 

5. The method according to any one of claims 1 to 4, further comprising the 
cloning of said nucleic acid coding sequences into a vector. 

*/5 
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The method according to any one of claims 1 to 5, wherein said removal of 
unfavorable interactions results in enhanced expression of said 
(poly)peptides. 

The method according to any one of claims 1 to 6, further comprising the 
steps of: 

(f) cleaving at least two of said cleavage sites located in regions adjacent 
to or between the ends of said sub-sequences; and 

(g) exchanging said sub-sequences by different sequences; and 

(h) optionally, repeating steps (f) and (g) one or more times. 

The method according to claim 7, wherein said different sequences are 
selected from the group of different sub-sequences encoding the same or 
different sub-elements derived from the same or different (poly)peptides. 

The method according to claims 7 or 8, wherein said different sequences are 
selected from the group of: 

(i) genomic sequences or sequences derived from genomic sequences; 

(ii) rearranged genomic sequences or sequences derived from 
rearranged genomic sequences; and 

(iii) random sequences. 

The method according to any one of claims 1 to 9 further comprising the 
expression of said nucleic acid coding sequences. 

The method according to any one of claims 1 to 10 further comprising the 
steps of: 

(i) screening, after expression, the resultant (poly)peptides for a desired 
property; 

(k) optionally, repeating steps (f) to (i) one or more times with nucleic acid 
sequences encoding one or more (poly)peptides obtained in step (i). 

The method according to claim 11, wherein said desired property is selected 
from the group of optimized affinity or specificity for a target molecule, 
optimized enzymatic activity, optimized expression yields, optimized stability 
and optimized solubility. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/08320 PCT/EP96/03647 

13. The method according to any one of claims 1 tc 12, whsrein said cleavage 
sites are sites cleaved by restriction enzymes. 

14. The method according to any one of claims 1 to 13, wherein said structural 
sub-elements comprise between 1 and 1 50 amino acids. 

15. * The method according to claim 14, wherein said structural sub-elements 

comprise between 3 and 25 amino acids. 

16. The method according to any one of claims 1 to 15, wherein said nucleic acid 
is DNA. 

17. The method according to any one of claims 1 to 16, wherein said 
(poly)peptides have an amino acid pattern characteristic of a particular 
species. 

18. The method according to claim 17, wherein said species is human. 

19. The method according to any one of claims 1 to 18, wherein said 
(poly)peptides are at least part of members or derivatives of the 
immunoglobulin superfamily. ' 

20. The method according to claim 19, wherein said members or derivatives of 
the immunoglobulin superfamily are members or derivatives of the 
immunoglobulin family. 

21 . The method according to claim 19 or 20, wherein said (poly)peptides are or 
are derived from heavy or light chain variable regions wherein said structural 
sub-elements are framework regions (FR) 1, 2, 3, or 4 or complementary 
determining regions (CDR) 1 T 2, or 3. 

The method according to claim 20 or 21, wherein said (poly)peptides are or 
are derived from the HuCAL consensus genes: 

Vk1, V»c2 t Vk3, Vk4, VX1, VX2, VX3, VH1A, VH1B, VH2, VH3, VH4, VH5, VH6, 
Ck, C\ t CM or any combination of said HuCAL consensus genes. 

The method according to any one of claims 2CF to 22, wherein said derivative 
of said immunoglobulin family or said combination is an Fv f disulphide-linked 
Fv, single-chain Fv (scFv), or Fab fragment. 
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24. The method according to claims 22 to 23, wherein said derivative is an scFv 
fragment comprising the combination of HuCAL VH3 and HuCAL VX2 

consensus genes that comprises a random sub-sequence encoding the, 
heavy chain CDR3 sub-element. 

25. The method according to any one of claims 1 to 24, wherein at least part of 
said (poly)peptide sequences or (poly)peptides is connected to a sequence 
encoding at least one additional moiety or to at least one additional moiety, 
respectively. 

26. The method according to claim 25, wherein said connection is formed via a 
contiguous nucleic acid sequence or amino acid sequence, respectively. 

27. The method according to claims 25 to 26, wherein said additional moiety is a 
toxin, a cytokine, a reporter enzyme, a moiety being capable of binding a 
metal ion, a peptide, a tag suitable for detection and/or purification, or a 
homo- or hetero-association domain. 

28. The method according to any one of claims 10 to 27, wherein the expression 
of said nucleic acid sequences results in the generation of a repertoire of 
biological activities and/or specificities, preferably in the generation of a 
repertoire based on a universal framework. 

29. A nucleic acid sequence obtainable by the method according to any of claims 
1 to 28. 

30. A collection of nucleic acid sequences obtainable by the method according to 
any of claims 1 to 28. 

31. A recombinant vector. obtainable by the method according to any of claims 5 
to 28. 

32. A collection of recombinant vectors obtainable by the method according to 
any of claims 5 to 30. 

33. A host cell transformed with the recombinant vector according to claim 31. 
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34. 
35. 

36. 
37. 

38. 

39. 
40. 



A collection of host cells transformed with the collection of recombinant 
vectors according to claim 32. 

A method of producing a (poly)peptide or a collection of (poly)peptides as 
defined in any of claims 1 to 28 comprising culturing the host cell according to 
claim 33 or the collection of host cells according to claim 34 under suitable 
conditions and- isolating said (poly)peptide or said collection of 
(poly)peptides. 

A (poly)peptide devisable by the method according to any one of claims 1 to 
3, encoded by the nucleic acid sequence according to claim 29 or obtainable 
by the method according to any one of claims 4 to 28 or 35. 

A collection of (poly)peptides devisable by the method according to any one 
of claims 1 to 3, encoded by the collection of nucleic acid sequences 
according to claim 30 or obtainable by the method according to any one of 
claims 4 to 28 or 35. 

A vector suitable for use in the method according to any of claims 5 to 28 and 
35 characterized in that said vector is essentially devoid of any cleavage site 
as defined in claim 1(e) and 2. 

The vector according to claim 38 which is an expression vector. 

A kit comprising at least one of: 

(a) a nucleic acid sequence according to claim 29; 

(b) a collection of nucleic acid sequences according to claim 30; 

(c) a recombinant vector according to claim 31 ; 

(d) a collection of recombinant vectors according to claim 32; 

(e) a (poly)peptide according to claim 36; 

(f) a collection of (poly)peptides according to claim 37; 

(g) a vector according to claim 38 or 39; and optionally, 

(h) a suitable host cell for carrying out the method according to claim 35. 



method of designing two or more genes encoding a collection of two or more 
proteins, comprising the steps of: 
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(a) either 

(aa) identifying two or more homologous gene sequences, or 

(ab) analyzing at least three homologous genes, and 
deducing two or more consensus gene sequences therefrom, 

(b) optionally, modifying codons in said consensus gene sequences 
to remove unfavourable interactions between amino acids in the 
resulting proteins, 

( c ) identifying sub-sequences which encode structural sub- 
elements in said consensus gene sequences 

(d) modifying one or more bases in regions adjacent to or between 
the ends of said sub-sequences to define one or more cleavage 
sites, each of which: 

(da) are unique within each consensus gene sequence, 

(db) do not form compatible sites with respect to any single 
sub-sequence, 

(dc) are common to all homologous sub-sequences. 

42. A method of preparing two or more genes encoding a collection of two or more 
proteins, comprising the steps of : 

(a) designing said genes according to claim 41 , and 

(b) synthesizing said genes. 



43 . A collection of genes prepared according to the method of claim 42. 



44. A collection of two or more genes derived from gene sequences which: 

(a) are either homologous, or represent consensus gene sequences 
derived from at least three homologous genes, and 

2£o 
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(b) carry cleavage sites, each of which: 

(ba) lie at or adjacent to the ends of genetic sub-sequences which 
encode structural sub-elements, 

(bb) are unique within each gene sequence, 

(be) do not form compatible sites with respect to any single sub- 
sequence, and 

(bd) are common to all homologous sub-sequences. 



45. The collection of genes according to either of claims 43 or 44 in which each of 
said gene sequences has a nucleotide composition characteristic of a 
particular species. 



46. The collection of genes according to claim 45 in which said species is human. 



47. The collection of genes according to any of claims 43 to 46 in which one or 
more of said gene sequences encodes at least part of a member of the 
immunoglobulin superfamily, preferably of the immunoglobulin family. 



48. The collection of genes according to claim 47 in which said structural sub- 
elements correspond to any combination of framework regions 1, 2, 3, and 4, 
and/or CDR regions 1, 2, and 3 of antibody heavy chains. 



49. The collection of genes according to claim 47 in which said structural sub- 
elements correspond to any combination of framework regions 1 , 2, 3, and 4 f 
and/or CDR regions 1, 2, and 3 of antibody light chains. 



50. A collection of vectors comprising a collection of gene sequences according 
to any of claims 43 to 49. 
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51. The collection of vectors according to claim 50 comprising the additional 
feature that the vector does not comprise any cleavage site that is contained 
in the collection of genes according to any of claims 43 to 49. 



A method for identifying one or more genes encoding one or more proteins 
having a desirable property, comprising the steps of: 

(a) expressing from the collection of vectors according to either of claims 
50 or 51 a collection of proteins. 

(b) screening said collection to isolate one or more proteins having a 
desired property, 

(c) identifying the genes encoding the proteins isolated in step (b), 

(d) optionally, excising from the genes encoding the proteins isolated in 
step (b) one or more genetic sub-sequences encoding structural sub- 
elements, and replacing said sub-sequence(s) by one or more second 
sub-sequences encoding structural sub-elements, to generate new 
vectors according to either of claims 50 or 51, 

(e) optionally, repeating steps (a) to (c). 



A method for identifying one or more genes encoding one or more antibody 
fragments which binds to a target, comprising the steps of: 

(a) expressing from the collection of vectors according to either of claims 
50 or 51 a collection of proteins, 

(b) screening said collection to isolate one or more antibody fragments 
which bind to said target, 

(c) identifying the genes encoding the proteins isolated in step (b), 

(d) optionally, excising from the genes encoding the antibody fragments 
isolated in step (b) one or more genetic sub-sequences encoding 
structural sub-elements, and replacing said sub-sequence(s) by one or 
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more second sub-sequences encoding structural sub-generate new 
vectors according to either of claims 50 or 51 , 

(e) optionally, repeating steps (a) to (c). 



54*. A kit comprising two or more genes derived from gene sequences which: 

(a) are either homologous, or represent consensus gene sequences 
derived from at least three homologous genes, and 

(b) carry cleavage sites, each of which: 

(ba) lie at or adjacent to the ends of genetic sub-sequences which 
encode structural sub-elements, 

(bb) are unique within each gene sequence, 

(be) do not form compatible sites with respect to any single sub- 
sequence, and 

(bd) are common to all homologous sub-sequences. 



55. A kit comprising two or more genetic sub-sequences which encode structural 
sub-elements, which can be assembled to form genes, and which carry 
cleavage sites, each of which: 

(a) lie at or adjacent to the ends of said genetic sub-sequences, 

(b) do not form compatible sites with respect to any single sub-sequence, 
and 

(d) are common to all homologous sub-sequences. 



ZZ3 
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Figure 1: construction of a synthetic human antibody library based on consensus sequences 
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Figure 6: oligonucleotides for gene synthesis 

OIKl 5'- GAATGCATACGCTGATATCCAGATGACCCAGAG- 
CCCGTCTAGCCTGAGC -3 ' 

01K2 5 ' - CGCTCTGCAGGTAATGGTCACACGATCACCCAC- 
GCTCGCGCTCAGGCTAGACGGGC -3 ' 

01K3 5 ' - GACCATTACCTGCAGAGCGAGCCAGGGCATTAG- 
CAGCTATCTGGCGTGGTACCAGCAG -3 ' 

01K4 5 ' - CTTTGCAAGCTGCTGGCTGCATAAATTAATAGT- 
TTCGGTGCTTTACCTGGTTTCTGCTGGTACCACGCCAG -3 ' 

01K5 5 1 - CAGCCAGCAGCTTGC AAAGCGGGGTCCCGTCCC - 
GTTTTAGCGGCTCTGGATCCGGCACTGATTTTAC -3 ' 

01K6 5 ' - GATAATAGGTCGC AAAGTCTTCAGGTTGCAGGC - 
TGCTAATGGTCAGGGTAAAATCAGTGCCGGATCC -3 ' 

02K1 5 ' - CGATATCGTGATGACCCAGAGCCCACTGAGCCT- 
GCCAGTGACTCCGGGCGAGCC -3 ' 

02K2 5 ' - GCCGTTGCTATGCAGCAGGCTTTGGCTGCTTCT- 
GCAGCTAATGCTCGCAGGCTCGCCCGGAGTCAC -3' 

02K3 5 ' - CTGCTGCATAGCAACGGCTATAACTATCTGGAT- 
TGGTACCTTCAAAAACCAGGTCAAAGCCC -3 ' 

02K4 5 ' - CGATCCGGGACCCCACTGGCACGGTTGCTGCCC- 
AGATAAATTAATAGCTGCGGGCTTTGACCTGGTTTTTG -3 ' 

02K5 5 ' - AGTGGGGTCCCGGATCGTTTTAGCGGCTCTGGA- 
TCCGGCACCGATTTTACCCTGAAAATTAGCCGTGTG -3 ' 

02K6 5 ' - CCATGCAATAATACACGCCCACGTCTTCAGCTT- 
CCACACGGCTAATTTTCAGGG -3 ' 

03K1 5 * - GAATGCATACGCTGATATCGTGCTGACCCAGAG- 
CCCGG -3 ' 

03K2 5 ' - CGCTCTGCAGCTCAGGGTCGCACGTTCGCCCGG- 
AGACAGGCTCAGGGTCGCCGGGCTCTGGGTCAGC -3 ' 

03K3 5 ' - CCCTGAGCTGCAGAGCGAGCCAGAGCGTGAGCA- 
GCAGCTATCTGGCGTGGTACCAG -3' 
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Figure 6: (continued) 

03K4 5 ' - GCACGGCTGCTCGCGCC ATAAATTAAT AGACGC - 
GGTGCTTGACCTGGTTTCTGCTGGTACCACGCCAGATAG -3 ' 

03K5 5 ' - GCGCGAGCAGCCGTGCAACTGGGGTCCCGGCGC- 
GTTTTAGCGGCTCTGGATCCGGCACGGATTTTAC -3 1 

03K6 5 ' - GATAATACACCGCAAAGTCTTCAGGTTCCAGGC - 
TGCTAATGGTCAGGGTAAAATCCGTGCCGGATC -3 1 

04K1 5 ' - GAATGCATACGCTGATATCGTGATGACCCAGAG- 
CCCGGATAGCCTGGCG -3' 

04K2 5 ' - GCTTCTGCAGTTAATGGTCGCACGTTCGCCCAG- 
GCTCACCGCCAGGCTATCCGGGC -3 ' 

04K3 5 1 - CGACCATTAACTGCAGAAGCAGCCAGAGCGTGC- 

TGTATAGCAGCAACAACAAAAACTATCTGGCGTGGTACCAG - 
3 ' 

04K4 5 ' - GATGCCCAATAAATTAATAGTTTCGGCGGCTGA- 
CCTGGTTTCTGCTGGTACCACGCCAGATAG -.3 ' 

04K5 5 * - AAACTATTAATTTATTGGGCATCCACCCGTGAA- 

AGCGGGGTCCCGGATCGTTTTAGCGGCTCTGGATCCGGCAC- 
3 ' 

04K6 5 ' - GATAATACACCGCCACGTCTTCAGCTTGCAGGG- 

ACGAAATGGTCAGGGTAAAATCAGTGCCGGATCCAGAGCC 
3 ' 

OILl 5 1 - GAATGCATACGCTCAGAGCG.TGCTGACCCAGCC- 
GCCTTCAGTGAGTGG -3 ' 

01L2 5'- CAATGTTGCTGCTGCTGCCGCTACACGAGATGG- 
TCACACGCTGACCTGGTGCGCCACTCACTGAAGGCGGC - 3 ' 

01L3 5 ' - GGCAGCAGCAGCAACATTGGCAGCAACTATGTG- 
AGCTGGTACCAGCAGTTGCCCGGGAC -3 ' 

01L4 5 1 - CCGGCACGCCTGAGGGACGCTGGTTGTTATCAT- 
AAATCAGCAGTTTCGGCGCCGTCCCGGGCAACTGC -3 ' 

OILS 5 ' - CCCTCAGGCGTGCCGGATCGTTTTAGCGGATCC- 
AAAAGCGGCACCAGCGCGAGCCTTGCG -3 ' 
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Figure 6: (continued) 

01L6 5'- CCGCTTCGTCTTCGCTTTGCAGGCCCGTAATCG- 
CAAGGCTCGCGCTGG -3 ' 

02L1 5 ' - GAATGCATACGCTCAGAGCGCACTGACCCAGCC- 
AGCTTCAGTGAGCGGC -3 ' 

02 L2 5 ' - CGCTGCTAGTACCCGTACACGAGATGGTAATGC- 
TCTGACCTGGTGAGCCGCTCACTGAAGCTGG -3 ' 

02L3 5 ' - GTACGGGTACTAGCAGCGATGTGGGCGGCTATA- 
ACTATGTGAGCTGGTACCAGCAGCATCCCGG -3 ' 

02L4 5 ' - CGCCTGAGGGACGGTTGCTCACATCATAAATCA- 
TCAGTTTCGGCGCCTTCCCGGGATGCTGCTGGTAC -3 ' 

02L5 5 * - CAACCGTCCCTCAGGCGTGAGCAACCGTTTTAG- 
CGGATCCAAAAGCGGCAACACCGCGAGCC -3 ' 

02L6 5 1 - CCGCTTCGTCTTCCGCTTGCAGGCCGCTAATGG- 
TCAGGCTCGCGGTGTTGCCG -3 ' 

03L1 ,5 ' - GAATGCATACGCTAGCTATGAACTGACCCAGCC- 
GCC TTCAGTGAGCG -3' 

03L2 5 ' - CGCCCAGCGCATCGCCGCTACACGAGATACGCG- 
CGGTCTGACCTGGTGCAACGCTCACTGAAGGCGGC -3 ' 

03L3 5 ' - GGCGATGCGCTGGGCGATAAATACGCGAGCTGG- 
TACCAGCAGAAACCCGGGCAGGCGC -3 ' 

03L4 5 ' - GCGTTCCGGGATGCCTGAGGGACGGTCAGAATC- . 
ATCATAAATCACCAGAACTGGCGCCTGCCCGGGTTTC -3 ' 

03L5 5 ' - CAGGCATCCCGGAACGCTTTAGCGGATCCAACA- 
GCGGCAACACCGCGACCCTGACCATTAGCGG -3 ' 

03L6 5 ' - CCGCTTCGTCTTCCGCCTGAGTGCCGCTAATGG- 
TCAGGGTC -3 ' 

01246H1 5'- GCTCTTCACCCCTGTTACCAAAGCCCAG- 
GTGCAATTG -3 ' 

01AH2 5 ' - GGCTTTGCAGCTCACTTTCACGCTGCTGCCCGG- 
TTTTTTCACTTCCGCGCCAGACTGAACCAATTGCACCTGGGC- 
TTTG -3 ' 
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Figure 6: (continued) 

01AH3 5 ' - GAAAGTGAGCTGCAAAGCCTCCGGAGGCACTTT- 

TAGCAGCTATGCGATTAGCTGGGTGCGCCAAGCCCCTGGGCAG 
GGTC -3' 

01AH4 5 ' - GCCCTGAAACTTCTGCGCGTAGTTCGCCGTGCC- 

AAAAATCGGAATAATGCCGCCCATCCACTCGAGACCCTGCCC- 
AGGGGC -3' 

01AH5 5 ' - GCGCAGAAGTTTCAGGGCCGGGTGACCATTACC- 

GCGGATGAAAGCACCAGCACCGCGTATATGGAACTGAGCAGCC 
TGCG -3' 

01ABH6 5 ' - GCGCGCAATAATACACGGCCGTATCTTCGCT- 
ACGCAGGCTGCTCAGTTCC -3 ' 

01BH2 5 ' - GGCTTTGCAGCTCACTTTCACGCTCGCGCCCGG- 

TTTTTTCACTTCCGCGGCGCTCTGAACCAATTGCACCTGGGC- 
TTTG -3 ' 

01BH3 5 ' - GAAAGTGAGCTGCAAAGCCTCCGGATATACCTT - 

TACCAGCTATTATATGCACTGGGTCCGCCAAGCCCCTGGGCAG 
GGTC -3 ' 

01BH4 5 ' - GCCCTGAAACTTCTGCGCGTAGTTCGTGCCGCC- 

GCTATTCGGGTTAATCCAGCCCATCCACTCGAGACCCTGCCCA 
GGGGC -3 ' 

01BH5 5 ' - GCGCAGAAGTTTCAGGGCCGGGTGACCATGACC- 

CGTGATACCAGCATTAGCACCGCGTATATGGAACTGAGCAGCC 
TGCG -3 ' 

02H2 5 ' - GGTACAGGTCAGGGTCAGGGTTTGGGTCGGTTT- 

CACCAGGGCCGGGCCGCTTTCTTTCAATTGCACCTGGGCTTTG 
-3 ' 

02H3 5 ' - CTGACCCTGACCTGTACCTTTTCCGGATTTAGC- 

CTGTCCACGTCTGGCGTTGGCGTGGGCTGGATTCGCCAGCCGC 
CTGGGAAAG -3 ' 

02 H4 5 ' - GCGTTTTCAGGCTGGTGCTATAATACTTATCAT- 

CATCCCAATCAATCAGAGCCAGCCACTCGAGGGCTTTCCCAGG 
CGGCTGG -3 ' 
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Figure 6: (continued) 

02 H5 5 ' - GCACCAGCCTGAAAACGCGTCTGACCATTAGCA- 
AAGATACTTCGAAAAATCAGGTGGTGCTGACTATGACCAACAT 
GG -3 ' 

02H6 5 ' - GCGCGCAATAATAGGTGGCCGTATCCACCGGGT- 
CCATGTTGGTCATAGTCAGC -3 ' 

03H1 5 ' - CGAAGTGCAATTGGTGGAAAGCGGCGGCGGCCT- 
GGTGCAACCGGGCGGCAG -3 ' 

03 H2 5 ' - CATAGCTGCTAAAGGTAAATCCGGAGGCCGCGC - 
AGCTCAGACGCAGGCTGCCGCCCGGTTGCAC -3 ' 

03H3 5 ' - GATTTACCTTTAGCAGCTATGCGATGAGCTGGG- 
TGCGCCAAGCCCCTGGGAAGGGTCTCGAGTGGGTGAG -3 ' 

03 H4 5 ' - GGCCTTTCACGCTATCCGCATAATAGGTGCTGC- 
CGCCGCTACCGCTAATCGCGCTCACCCACTCGAGACCC -3 ' 

03 H5 5 " - CGGATAGCGTGAAAGGCCGTTTTACCATTTCAC- 
GTGATAATTCGAAAAACACCCTGTATCTGCAAATGAACAG- 3 ' 

03H6 5 ■ - CACGCGCGCAATAATACACGGCCGTATCTTCCG- 
CACGCAGGCTGTTCATTTGCAGATACAGG -3 ' 

04H2 . 5 ' - GGTCAGGCTCAGGGTTTCGCTCGGTTTCACCAG- 
GCCCGGACCACTTTCTTGCAATTGCACCTGGGCTTTG -3 ' 

04 H3 5 ' - GAAACCCTGAGCCTGACCTGCACCGTTTCCGGA- 
GGGAGCATTAGCAGCTATTATTGGAGCTGGATTCGCCAGCCGC 
-3' 

04H4 5 ' - GATT ATAGTTGGTGCTGCCGCTATAATAAATAT - 
AGCCAATCCACTCGAGACCCTTCCCAGGCGGCTGGCGAATCCA 
G -3' 

04H5 5 ' - CGGCAGCACCAACTATAATCCGAGCCTGAAAAG- 
CCGGGTGACCATTAGCGTTGATACTTCGAAAAACCAGTTTAGC 
CTG -3 ' 

04H6 5 ' - GCGCGCAATAATACACGGCCGTATCCGCCGCCG- 
TCACGCTGCTCAGTTTCAGGCTAAACTGGTTTTTCG -3 ' 
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Figure 6: (continued) 

05H1 5'- GCTCTTCACCCCTGTTACCAAAGCCGAAGTGCA- 
ATTG -3 ' . 

05H2 5 ' - CCTTTGCAGCTAATTTTCAGGCTTTCGCCCGGT- 

TTTTTCACTTCCGCGCCGCTCTGAACCAATTGCACTTCGGCTT 
TGG -3 ' 

05H3 5 ' - CCTGAAAATTAGCTGCAAAGGTTCCGGATATTC - 

CTTTACGAGCTATTGGATTGGCTGGGTGCGCCAGATGCCTGG 
-3' 

05H4 5'- CGGAGAATAACGGGTATCGCTATCGCCCGGATA- 

AATAATGCCCATCCACTCGAGACCCTTCCCAGGCATCTGGCGC 
AC -3 ' 

05H5 5 ' - CGATACCCGTTATTCTCCGAGCTTTCAGGGCCA- 

GGTGACCATTAGCGCGGATAAAAGCATTAGCACCGCGTATCTT 
C -3 1 

05H6 5 ' - GCGCGCAATAATACATGGCCGTATCGCTCGCTT- 
TCAGGCTGCTCCATTGAAGATACGCGGTGCTAATG -3 ■ 
06H2 5 ' - GAAATCGCACAGGTCAGGCTCAGGGTTTGGCTC- 

GGTTTCACCAGGCCCGGACCAGACTGTTGCAATTGCACCTGG- 
GCTTTG -3 ' 

06H3 5 ' - GCCTGACCTGTGCGATTTCCGGAGATAGCGTGA- 

GCAGCAACAGCGCGGCGTGGAACTGGATTCGCCAGTCTCCTGG 
GCG -3 ' 

06H4 .5 ' - CACCGCATAATCGTTATACCATTTGCTACGATA- 

ATAGGTACGGCCCAGCCACTCGAGGCCACGCCCAGGAGACTG- 
GCG -3 * 

06H5 5'- GGTATAACGATTATGCGGTGAGCGTGAAAAGCC - 

GGATTACCATCAACCCGGATACTTCGAAAAACCAGTTTAGCCT 
GC -3 ' 

06H6 5 ' - GCGCGCAATAATACACGGCCGTATCTTCCGGGG- 
TCACGCTGTTCAGTTGCAGGCTAAACTGGTTTTTC -3' 

OCLK1 5 ' - GGCTGAAGACGTGGGCGTGTATTATTGCCAGCA- 

GCATTATACCACCCCGCGGACCTTTGGCCAGGGTAC -3' 
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Figure 6: (continued) 

OCLK2 5 ' - GCGGAAAAATAAACACGCTCGGAGCAGCCACCG 
TACGTTTAATTTCAACTTTCGTACCCTGGCCAAAGGTC -3 ' 

OCLK3 5 ' - GAGCGTGTTTATTTTTCCGCCGAGCGATGAACA 
ACTGAAAAGCGGCACGGCGAGCGTGGTGTGCCTGCTG -3 ' 

OCLK4 5 ' - CAGCGCGTTGTCTACTTTCCACTGAACTTTCGC 
TTCACGCGGATAAAAGTTGTTCAGCAGGCACACCACGC -3 ' 

OCLK5 5 1 - GAAAGTAGACAACGCGCTGCAAAGCGGCAACAG 
CCAGGAAAGCGTGAGCGAACAGGATAGCAAAGATAG -3 ' 

OCLK6 5 * - GTTTTTCATAATCCGCTTTGCTCAGGGTCAGGG 
TGCTGCTCAGAGAATAGGTGCTATCTTTGCTATCCTGTTCG 
3 ' 

OCLR7 5 ' - GCAAAGCGGATTATGAAAAACATAAAGTGTATG 
CGTGCGAAGTGACCCATCAAGGTCTGAGCAGCCCGGTG -3 1 

OCLK8 5 ' - GGCATGCTTATCAGGCCTCGCCACGATTAAAAG- 
ATTTAGTCACCGGGCTGCTCAGAC -3 1 

OCH1 5'- GGCGTCTAGAGGCC AAGGCACCCTGGTGACGGT - 
TAGCTCAGCGTCGAC -3 ' 

OCH2 5'- GTGCTTTTGCTGCTCGGAGCCAGCGGAAACACG- 
CTTGGACCTTTGGTCGACGCTGAGCTAACC -3 ' 

. OCH3 5 ' - CTCCGAGCAGCAAAAGCACCAGCGGCGGCACGG- 
CTGCCCTGGGCTGCCTGGTTAAAGATTATTTCC -3 ' 

OCH4 5 ' - CTGGTCAGCGCCCCGCTGTTCCAGCTCACGGTG- 
ACTGGTTCCGGGAAATAATCTTTAACCAGGCA -3 ' 

OCH5 5 ' - AGCGGGGCGCTGACCAGCGGCGTGCATACCTTT- 
CCGGCGGTGCTGCAAAGCAGCGGCCTG -3 ' 

OCH6 5'- GTGCCTAAGCTGCTGCTCGGCACGGTCACAACG- 
CTGCTCAGGCTATACAGGCCGCTGCTTTGCAG -3 ' 

OCH7 5 ' - GAGCAGCAGCTTAGGCACTCAGACCTATATTTG- 
CAACGTGAACCATAAACCGAGCAACACC -3 ' 

OCH8 5 ' - GCGCGAATTCGCTTTTCGGTTCCACTTTTTTAT- 

CCACTTTGGTGTTGCTCGGTTTATGG -3 ' 
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Figure 11: Expression analysis of initial library 
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Figure 12: Increase of specificity during the panning rounds 



O 
CO 
LL 

>» 
-t— ^ 

'o 
~"o 

0 
Q. 
CO 




Panning Round 



SUBSTITUTE SHEET (RULE 26) 
60 / 204 



WO 97/08320 



PCT/EP96/03647 




WO 97/08320 PCT7EP96/03647 



Figure 1 4: Competition ELISA 
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Figure 16: Purification of fluorescein binding scFv fragments 
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Figure 17: Enrichment factors after three rounds of panning 
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Figure 19: Selectivity and cross-reactivity of HuCAL antibodies 
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Figure 25a: List of unique restriction sites used in or suitable for HuCAL genes or pCAL vectors 
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Figure 3Sb: List of oligonucleotides used for synthesis of modules 

M1 : PCR using template 
NoVspAatll: TAGACGTC 

M2: synthesis 

BloxA-A: TAT6A6ATCTCATAACTTCGTATAATGTAC6CTATAC6 - 
AAGTTAT 

BloxA-B: TAATAACTTCGTATAGCATACATTATACGAAGTTATG- 
AGATCTCA 

M3: PCR. NoVspAatll as second olign 

XloxS-muta: CATTmTGCCCTCGTTATCTACGCATGCGATAACTTCGTA- 
TAGCGTACATTATACGAAGTTATTCTAGACATGGTCATAGCTGTTTCCTG 

M7-I: PCR 

gill NEW-fo w : GGGGGGAATTCGGTGGTGGTGGATCTGCGTGCGCTG - 
AAACGGTTGAAAGTTG 

glllNEW-rev: CCCCCCCAAGCTTATCAAGACTCCTTATTACG 
M7-II: PCR 

glllss-fow: GGGGGGGGAATTCGGAGGCGGTTCCGGTGGTGGC 
M7-III: PCR 

glllsupernew-fow: GGGGGGGGAATTCGAGCAGAAGCTGATCTCT- 
GAGGAGGATCTGTAGGGTGGTGGCTCTGGTTCCGGTGATTTTG 
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Figure 35b: List of oligonucleotides used for synthesis of modules (continued) 

M8: synthesis 

lox514-A: CCATAAC7TCGTATAAT6TACGCTATACGAA6TTATA 
lox51 4-B: AG CTTATAACTTCGTATAG CGTACATTATACG AAGT- 
TATGGCATG 

M9II: synthesis 

M9ll-fow: AGCTTGACCTGTGAAGTGAAAAATGGCGCAGATT- 
GTGCGACAI 1 1 1 1 1 1 IGTCTGCCGTTTAATTAAAGGGGGGGT 
M9ll-rev:GTACACCCCCCCCCAGGCCGGCCCCCCCCCCCCTTTAA- 
TTAAACGGCAGACAAAAAAAATGTCGCACAATCTGCG 

M10H: assembly PCR with template 

bla-fow: GGGGGGGTGTACATTCAAATATGTATCCGCTCATG 

bla-seq4: GGGTTACATCGAACTGGATCTC 

blal -muta: CCAGTTCGATGTAACCCACTCGCGCACCCAACTGATC- 

CTC AG C ATCmTAGTTTC ACC 

blall-muta: ACTCTAGCTTCCCGGCAACAGTTAATAG ACTGGATG - 
GAGGCGG 

bla-NEW: CTGTTGCCGGGAAGCTAGAGTAAG 

bla-rev: CCCCCCCTJAATTAAG GGGG G GGGCCG G CCATTATCAAA - 

AAG G ATCTCAAG AAG ATCC 

Ml 1 11/111: PCR. site-directed mutagenesis 
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Figure 35b: List of oligonucleotides used for synthesis of modules (continued) 

fl-fow: GGGGGGGGCTAGCACGCGCCCTGTAGCGGCGCAnAA 

fl-rev: CCCCCCCTGTACATGAAATTGTAAACGTTAATATTTTG 

f 1 -t1 33.muta: GGGCGATGGCCCACTACGAGAACCATCACCCTAATC 

M12: assembly PCR using template 

p1 5-fow: GGGGGGAGATCTAATAAGATGATCTTCTTGAG 

p1 5-NEWI: GAGTTGGTAGCTCAGAGAACCTACGAAAAACCGCCCTG- 

CAAGGCG 

p1 5-NEWII : GTAGGTTCTCTGAGCTACCAACTC . 

pi 5-NEWIII: GTTTCCCCCTG GCG G CTCCCTCCTG CG CTCTCCTGTTCCT- 

GCC 

pi 5-NEWIV: AGGAGGGAGCCGCCAGGGGGAAAC 
p15-rev: GACATCAGCGCTAGCGGAGTGTATAC 

Ml 3: synthesis 

BloxXB-A: GATCTCATAACTTCGTATAATGTATGCTATACGAAGTTA- 
HCA 

BloxXB-B: GATCTGAATAACTTCGTATAGCATACATTATACGAAGTTA- 
TGAGA 

Ml4-Ext2: PCR. site-directed mutagenesis 

ColEXT2-fow: GGGGGGGAGATCTGACCAAAATCCCTTAACGTGAG 

Col-mutal: GGTATCTGCGCTCTGCTGTAGCCAGTTACCTTCGG 
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Figure 35b: List of oligonucleotides used for synthesis of modules (continued) 

Col-rev: CCCCCCCGCTAGCCATGTGAGCAAAAGGCCAGCAA 

Ml 7: assembly PCR using template 
CAT-1: GGGACGTCGGGTGAGGTTCCAAC 
CAT-2: CCATACG G AACTCCGG GTG AGCATTCATC 
CAT-3: CCG G AGTTCCGTATGG 
CAT-4: ACGTTTAAATCAAAACTGG 

CAT-5 : CCAGTTTTGATTTAAACGTAGCCAATATG G ACAACTTCTTC- 

GCCCCCGTTTTCACTATGGGCAAATATT 

CAT-6: GGAAGATCTAGCACCAGGCGnTAAG 



M41 : assembly PCR using template 

LAC1 : GAGGCCGGCCATCGAATGGCGCAAAAC 

LAC2: CGCGTACCGTCCTCATGGGAGAAAATAATAC 

LAC3: CCATGAGGACGGTACGCGACTGGGCGTGGAGCATCTGGTCGCA- 

TTG GGTCACCAGCAAATCCG CTGTTAG CTGG CCCATTAAG 

LAC4: GTCAGCGG CG GG ATATAACATGAGCTGTCCTCGGTATCGTCG 

LAC5: GTTATATCCCGCCGCTGACCACCATCAAAC 

LAC6: CATCAGTGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGT4TTG - 

GGAGCCAGGGTGGTTTTTC 

LAC7 : GGTTAATTAACCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCC- 
AGCTG CATCAGTG AATCG G CCAAC 

M41-MCS-fow: CTAGACTAGTGTTTAAACCGGACCGGGGGGGGGCTT- 
AAGGGGGGGGGGGG 
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Figure 35b: List of oligonucleotides used for synthesis of modules [continued) 

M41 -MCS-rev: CTA6CCCCCCCCCCCCTTAAGCCCCCCCCCGGTCC66T- 
TTAAACACTAGT 

M41-fow: CTAGACTAGTGTTTAAACCGGACCGGGGGGGGGCTTAA- 
GGGGGGGGGGGG 

M41 -rev: CCCCCCCTTAAGTGGGCTGCAAAACAAAACGGCCTCC- 
TGTCAGGAAGCCGCTTTTATCGGGTAGCCTCACTGCCCGCTTTCC 
M41 -A2: GTTGTTGTGCCACGCGGTTAGGAATGTAATTCAGCTCCGC 
M41 -B1 : AACCG CGTGGCACAACAAC 

M41 -B2: CTTCGTTCTACCATCGACACGACCACGCTGGCACCCAGTTG 

M41 -CI : GTGTCGATGGTAGAACGAAG 

M41 -Cll: CCACAGCAATAGCATCCTGGTCATCCAGCGGATAG7T- 

AATAATCAGCCCACTGACACGTTGCGCGAG 

M41 -Dl : G ACCAGGATGCTATTGCTGTGG 

M41 -Dll: CAGCGCGATTTGCTGGTGGCCCAATGCGACCAGATGC 

M41 -El : CACCAGCAAATCGCGCTG 

M41 -Ell: CCCGGACTCGGTAATGGCACGCATTGCGCCCAGCGCC 
M41-FI: GCCATTACCGAGTCCGGG 

M42: synthesis 

Eeo-H5-Hind-fow: AATTCCACCATCATCACCATTGACGTCTA 
Eeo-H5-Hind-rev: AGCTTAGACGTCAATGGTGATGATGGTGG 



SUBSTITUTE SHEET (RULE 25} 



187/204 



PCT/EP96/03647 



CD 
CO 



CD 

CQ 



CD 
CD 

^£ 

CD ^ 
- — m 

cd 8 
3 ^ 



CD 

^rv. cd 
t- CO co 

h— t— CO 
CO 



CO 



CO 

o 



_ ^ LU 



x: *o on 



8- 

CQ 



CQ 



CD 

UO CD 

CO ^ O 

t- CO 



^ -s s 

t- oo n £ 



lo CO - £ 
°- co D> 1 

CO CD 

CQ LU 

CN t- 
r- O N 
CN 



CD 

I 

CO 



CO 
O 

CO 



CL 
JQ 

CD 
00 



s2 



CD 
CQ 



CD 

5-"-. 



0> 

CO CM 
r- CO 



CO 



CD 



£: CQ 

CL co 
co 
CQ 



_ -^CD CD 



m ^ 



CD 
CM 



Jo ^ § = ^TcD § ^ 

in — — _ rn^rr 



SUBSTITUTE SMEET (RULE 25) 
188/204 



WO 97/08320 



PCT/EP96/03647 



H 

>1 

C/l 



H I 

H ? 

LD ? 

w ? 

P-i ? 



H 

cn 
o 

o 
o 
u 
w 



H 

X 

4J 
W 



H 

m 
w 



H 
3 



> 

w 
s 

H 

-H 
W 



H 
U 

a 



H 
M 
W 
4-> 



H 

a 



< Eh 

o u 

u o 

E-i ' 



Eh . 
O U 
U O 
<C Eh 
Eh < 
O U 

U O 

U O 

U O 

Eh < 

O O 

O U 

< Eh 

< Eh 
U O 
O O 

O O 
O O 
Eh < 
U O 
U O 
U 0 
U O 
O O 
Eh 
Eh 

U O 
U O 

< Eh 

o u 

Eh rfj 
O O 
O O 

u q 

Eh < 

u o 
u o 

Kd Eh 

23 Eh 
Eh 
Eh 

o o 
u o 
o o 
u o 



H ? 

g I 

? 



H 

CD 

w 



H 



H 

PQ 

w 
m 
> 

CO 



o u 
o o 

< Eh 

u o 
u o 

< Eh 
Eh < 

o u 
a o 
u o 

u q 

Eh < 

< Eh 
O O 

o u 

Eh < 

o o 



< 

O 

Eh 
< 



Eh 
O 

Eh 



O O 

O O 

< Eh 
Eh 

< Eh 

O u 

<C Eh 



<C Eh 

< Eh 
U O 
O U 
Eh < 
O U 

< Eh 
U O 

eh <: 
u o 

o o 
u o 

< Eh 

o o 

Eh «< 

o o 

Eh < 

o o 
o o 
o u 



Eh < 

u o 



Eh 



o u 
o u 

Eh 
Eh 
Eh 
Eh 

< Eh 

u o 



< 

Eh 



Eh 

< 

Eh 

Eh < 

< Eh 
O U 

< Eh 
O U 



Eh 



< 

Eh 



U O 
Eh < 

o a 
o u 

Eh 
Eh 
Eh 
Eh 

rtj" Eh 

o u 
o o 
o u 

Eh 

< Eh 

Eh <£ 
Eh < 
O U 
U O 

< B 
U O 
Eh <J 
O O 

Eh 
Eh 
Eh 
Eh 

o o 



CM 



1> 



VD 
CM 



SUBSTITUTE SHEET (RULE 26) 
189/204 



WO 97/08320 



PCT/EP96/03647 



O 
o 



"a 
o 
E 

o 



re 

E 

re 

o 
re 



QJ 

cr 



re 

Q. 
re 

E 

re 
c 
o 



to 
n 



u cd 



Eh 



< 

En 



CD CJ 

<C Eh 

Eh < 

U CD 

U CD 

< Eh 

u cd 

Eh < 



vo 

CM 
CO 



u cd 

O U 

O U 

< Eh 
O U 
Eh < 
CD U 

< Eh 
U O 
Eh < 

< Eh 

< Eh 
Eh 

Eh . 

U O 

o a 

Eh < 

< Eh 

< Eh 

O p 

O CD 

< Eh 
Eh 
Eh 
O U 

< Eh 
U O 

< Eh 
CD CJ 
Eh 




Eh < 

O cj 

< Eh 

O O 

Eh < 



Eh 
< 
Eh 



Eh 

Eh 
< 
Eh 



cj o 

cj o 

Eh < 

a o 

< Eh 

cd u 

Eh < 

U CD 

u o 

o u 



vo 
n 



U CD 

Eh < 

CD O 

Eh < 

o q 

Eh < 

< Eh 

O o 

CJ O 

o o 

<: Eh 

u q 

Eh < 

u q 

Eh < 

< Eh 

Eh < 

CJ O 

CJ O 

< En 



VO 
CN 



Eh < 

CD U 

< Eh 
CJ O 

CJ o 

u o 

u o 

CD CJ 

O CJ 

Eh < 

u o 

Eh < 

< Eh 
CJ O 
U CD 
<C Eh 
Eh 
Eh 
CJ O 
CD U 

O U 

O U 

< Eh 
O CJ 
0 O 
O CJ 
U CD 

' Eh 
< 
Eh 



< 



cd u 
u o 



Eh 



Eh 



CJ O 
Eh 
Eh 

< 
Eh 



Eh 

<; 

CD CJ 

< £ 

Eh < 

O q 

Eh < 

O q 

u o 

Eh < 

cd o 

o o 

u o 



VO 

<H1 



CJ 


CJ 


Eh 




cd 


o 


Eh 


% 


< 


Eh 


CJ 


o 


o 


O 




Eh 


Eh 


< 




Eh 


<< 


Eh 


U 


O 


Eh 




o 


u 


Eh 




Eh 


< 


Eh 




u 


a 


< 


Eh 


u 


o 


o 


U 


Eh 


< 




Eh 


O 


CJ 


o 


O 


o 


u 


cj 


o 


Eh 


<< 


Eh 


< 


O 


CJ 


CJ 


o 




Eh 


O 


CJ 




Eh 


O 


CJ 




CJ 


CJ 


O 




Eh 


CJ 


CD 


o 


O 


< 


Eh 


o 


CJ 




cd 


rj 


CD 






ro 


rj 


r > 


C!) 






CO 


rj 


CO 

vJ 


(J 


r * 

V—/ 


o 


r ^ 


CD 




Fh 
l. 1 


r j 


CO 


r > 


CO 


CO 


CJ 


r * 
v»y 




CO 


rj 


r » 


CD 


CO 


CJ 


< 


Eh 




Eh 


co 


n 




Eh 


< 




CO 


CJ 


o 


r j 


CO 


CJ 


a 


ro 




CO 


o 


V—/ 




CO 


u 




CO 


CJ 

VJ 


a 


CO 






< 


i_ • 


r \ 
\~J 


CO 


Eh 




r \ 


CO 


< 




CO 


CJ 


TG 


U 


< 


Eh 


< 


CJ 


CD 




Eh 


CJ 


o 


5 


Eh 




Eh 


CJ 


O 




Eh" 


o 


O 




Eh 


Eh 


< 




< 


U 


o 




Eh 


O 


u 




Eh 


vo 








CN 




i> 




in 




m 





Eh < 

O U 

< Eh 

< Eh 
Eh < 
CD CJ 

< Eh 
CD U 

< Eh 
Eh <C 

U CD 

cd a 

< Eh 

< Eh 

CD U 
O O 
O U 
CJ CD 
U CD 
O O 

Eh 
Eh 
CD U 

Eh < 

u o 

Eh 

< Eh 
Eh 
Eh 
< 

Eh < 

U CD 

Eh < 

O CJ 

< ^ 

CJ CD 

u q 

Eh < 

< Eh 
U CD 

U O 

Eh < 

U CD 

U O 

O U 

U CD 

CJ CD 

Eh < 
Eh < 



CN 



Eh < 
<Eh 

O CD 

o u 

cd a 

< Eh 

u o 

< Eh 
Eh < 

u o 



CD CJ 

< Eh 

u o 

u o 

CD O 

U CD 

Eh 
Eh 

CD U 

< Eh 



vo 



SUBSTITUTE SHEET (RULE 26) 
190/204 



WO 97/08320 



PCT/EP96/03647 



cj cd 

u cd 

cj o 
Eh 



O U 

CD CJ 

U O 

CJ CD 

Eh < 

u o 
cd u 

< En 
CJ O 
Eh 

Eh . 

< Eh 
CJ O 
Eh 
Eh 

U O 
O U 
O O 
Eh < 

< Eh 
Eh < 
CD CJ 
O O 
Eh 
Eh 

Eh < 
O CJ 
CJ CD 
Eh < 
CD U 
U CD 
Eh < 
CJ CD 

o CJ 

u o 

< Eh 

cj cd 

Eh < 

cd cj 

Eh < 

cd u 
cd u 

Eh <C 

cd cj 
u o 



3 



<; Eh 

cj cd 

u cd 

a cd 

cj cd 

a cd 



Eh 



< 

Eh 



cd cj 

Eh < 

<: Eh 

u cd 

< - 

Eh 
Eh 

o cj 

< Eh 

cd cj 
u cd 
o cj 

cd cj 
cj cd 



Eh 

< 



Eh 



O U 
CJ CD 

3 Eh 



Eh 

Eh . 

CD U 

Eh < 

CD U 

< Eh 

cj cd 

CD U 
U CD 

u cd 

o u 
cd u 

Eh ' 
Eh 

CD U 

< Eh 
<<3 Eh 
Eh < 
CD CJ 

< Eh 

< Eh 
O CJ 
<C Eh 
O CD 
Eh < 

O U 

Eh ' 
Eh 

O U 

cj cd 



Eh 



Eh 



s 



Eh 



O O 
CJ O 
U O 
Eh < 
CJ O 

a cd 

Eh < 

cd o 

o o 
u o 

Eh 
Eh 
U CD 

a o 

Eh < 

cj cd 
o cj 
<: Eh 



a cd 
u cd 
cd o 

Eh < 
< Eh 

a cd 

Eh < 
CD CJ 

Eh < 
CJ CD 
<C Eh 
Eh 
Eh 

cj o 

Eh < 
U O 
Eh 
Eh 



Eh 
Eh 

< 
Eh 
U O 
O CJ 
Eh < 

a o 

<C Eh 
U CD 

O CJ 
< Eh 
CJ O 
CD CJ 
O CJ 
Eh <: 

<: _ 

Eh 
Eh 
CD CJ 

cd cj 

Eh < 

<C Eh 

CJ CD 

Eh < 

U CD 

<C Eh 

u cd 



< Eh 

cd cj 

Eh < 

cj cd 

Eh 
Eh 

<C Eh 

O CD 

Eh < 

CD CJ 



3 



Eh 

. Eh 

CJ O 

O O 

< Eh 
*J Eh 
O CD 
Eh < 
CJ CD 

< Eh 

Eh < 

O U 

< Eh 
CD U 
Eh < 
CD U 
CD CJ 
Eh < 
CJ CD 

< Eh 




Eh 
< 



< 
Eh 



cd a 

Eh < 
< Eh 
CD O 
<d Eh 
Eh 
Eh < 
CD U 
U CD 
CJ CD 



< Eh 

CD CJ 
CD CJ 
CD CJ 
CJ CD 

< Eh 
Eh < 

< Eh 

< Eh 
CJ CD 

Eh < 
CD CJ 
CJ CD 
CD CJ 
CD CJ 
CJ CD 
CJ CD 
CJ CD 
CD CJ 
Eh < 

Eh < 

CJ CD 

Eh < 

CJ CD 

CD CJ 
Eh 



Eh 

CD CJ 

< Eh 
CD CJ 

CJ CD 

a cd 

< Eh 
CD CJ 

a cd 

CD O 
CD CJ 
CJ CD 
O CJ 
Eh < 

< Eh 
Eh < 
CD CJ 

Eh < 

CD CJ 

< Eh 
< 
Eh 
Eh 

CD CJ 




CJ CD 
Eh < 

< Eh 
O CD 
Eh < 
U O 
CD CJ 
Eh < 
CD CJ 

< Eh 




•< Eh 
CJ CD 
CD CJ 



Eh 
< 



Eh 
Eh 



CJ CD 

U CD 
U O 

CD CJ 
CJ CD 
CD CJ 
CJ CD 
CJ CD 
< Eh 
< 
Eh 
Eh 
< 



Eh < 
CD O 

< Eh 
CJ CD 
CJ CD 
Eh < 

< Eh 
CD CJ 

< Eh 
CD CJ 

Eh < 
Eh ^ 
CD CJ 
Eh < 
U CD 
CD CJ 
CJ CD 
CJ CD 

< Eh 
Eh < 

Eh < 

u o 

Eh < 

< Eh 
CD U 
CD U 
^ Eh 

< Eh 

U CD 
Eh < 

u o 

Eh < 

CJ CD 

Eh 
Eh 

Eh 
Eh 

CD cj 

CJ CD 
.CD CJ 

CD CJ 
CD CJ 
CD CJ 

a cd 

E^ 

u o 

Eh < 
Eh < 



CM 
00 



O 

00 



CNI 
Ch 



I> 



SUBSTITUTE SHEET (RULE 28) 
191/204 



o 



O 
O 



WO 97/08320 



PCT/EP96/03647 




Eh < 

< Eh 
U O 

CD CJ H 

< Eh I- 

CJ CD in 

Eh < O 

Eh < U 

CJ CD W 

Eh < 



< Eh 
CD CJ 
Eh < 
U CD 
d Eh 

< Eh 

cj cd 
cj cd 

CJ CD 

< Eh 

U CD 
CD CJ h 
Eh < 

a cj 

U CD 



CJ CD 
«< Eh 
CJ O 

o o 

U CD 
Eh 



Eh 
Eh < 
CD CJ 



Eh 



W 
(0 
CO 



Eh 



CD CJ 
U CD 
Eh < 



CD CJ 

<C Eh 

CD CJ 

Eh < 

CD CJ 

CD CJ 

CD CJ 

Eh < 

CJ CD 

Eh < 

Eh *3 

CD CJ 

CJ O 

CD CJ 

< Eh 
U CD 
CJ CD 

< Eh 




Eh < 
CJ O 
< Eh 



< 
Eh 



U CD 

Eh < 

CJ CD 

<C Eh 

Eh < 




CD CJ 

CD CJ 

U O 
< 

U CD 

< E^ 

CD CJ 

CJ O 

CD CJ 

CD CJ 

O CJ 

Eh < 

O CJ 

CD CJ 

CD CJ 



CJ CD < Eh 



Eh 






J 


M ? 


CD 


CJ 




< 


Eh 


H 


I 


H 


<C 


Eh 




CJ 


CD 


O 




X 


CD 


CJ 




< 


Eh 




? 


w 


CJ 


CD 




Eh 


< 


X 


} 


W 


Eh 


< 




< 


Eh 




i 


PQ 


U 


CD 




CD 


CJ 








CD 


CJ 




CD 


CJ 








O 


U 




CJ 


CD 








Eh 






CD 


CJ 








< 


Eh 












H I 








< 


Eh 






d) ) 


< 


Eh 




CD 


CJ 






0) ) 


Eh 


S 




Eh 


< 






< ? 


Eh 






< 


Eh 








< 


GT 




CJ 


CD 








CJ 




Eh 


< 






H ? 


a 


O 




CJ 


CD 






(D t 


CD 


U 




Eh 


< 






,Q J 


CJ 






CD 


u 






PQ ) 


CD 


r \ 
\J 




Eh 


< 




j 




CD 


r \ 
KJ 


H 








1 








CD 


Eh 


< 


H 


J 






r~ » 
fcH 


^-i 


< 


Eh 


jj 


J 




u 




in 


EH 


< 


W 


) 




o 


CJ 


PQ 




< 




} 




Fh 


<C 






CJ 




} 




CJ 


r l"i 


H 




u 






H I 


o 


r \ 


H 

f l 




cj 






w i 




Eh 




< 










CO 


U 


CO 










rn > 


M 


CD 




u ■ 


< 






03 ? 


<d 


Eh 




< 


Eh 






j 


U 


CD 














CD 


U 












i i i 


CJ 


CD 






<! 








CJ 


O 




S 










CD 


CJ 




r ) 


CD 






r-M * 


CD 


U 






o 








CJ 


CD 




Eh 








Eh 


< 




S 


u 








CJ 


O 






rj 








< 


Eh 




Eh 










Eh 






Eh 










TG 


CJ 




< 


Eh 








< 




Eh 


% 










Eh 




Eh 










$ 


Eh 


H 


< 


% 








CD 


U 


H 


Eh 










Eh 




rc 




Eh 








Eh 




CO 


$ 


Eh 








Eh 




CO 


CJ 


CD 








< 


Eh 


PQ 

















<N 


O 


CM 




rH 


CN 


CN 


ro 






tH 


rH 


rH 



SUBSTITUTE SHEET (RULE 26) 
192/204 



WO 97/08320 PCT/EP96/03647 




SUBSTITUTE SHEET (RULE 26) 
193/204 



WO 97/08320 



PCT/EP96/03647 



Figure 37: Oligo and primer design forVicCDR3 libraries 
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Figure 37: Oligo and primer design for Vk CDR3 libraries 
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Figure 37: Oligo and primer design for Vk CDR3 libraries 
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Figure 37: Oligo and primer design for Vk CDR3 libraries 
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Figure 38: Oligo and primer design forVX CDR3 libraries 
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Figure 38: Oligo and primer design for W. CDR3 libraries 
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Figure 38: Oligo and primer design for VX CDR3 libraries 
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Figure 38: Oligo and primer design for Vk CDR3 libraries 
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